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Abstract. Let Xi,X2, ... be independent identically distributed (i.i.d.) ran- 
dom variables with EX^ = 0, Var Aj, = 1. Suppose that ip(t) := logEe tXfc < 
oo for all t > —(jo and some <tq > 0. Let Sk = X\ + . . . + Aj, and So = 0. We 
are interested in the limiting distribution of the multiscale scan statistic 

M n = max 

0<i<j<n \J j — i 

We prove that for an appropriate normalizing sequence a„, the random variable 
— a n converges to the Gumbel extreme-value law exp{— e~ cx }. The be- 
havior of M n depends strongly on the distribution of the X^'s. We distinguish 
between four cases. In the superlogarithmic case we assume that <p(t) < i 2 /2 
for every t > 0. In this case, we show that the main contribution to M n comes 
from the intervals having length I := j — i of order a(logn) p , a > 0, where 
p = q/(q — 2) and q £ {3, 4, . . .} is the order of the first non-vanishing cumu- 
lant of Ai (not counting the variance). In the logarithmic case we assume that 
the function ij)(t) := 2ip(t)/t 2 attains its maximum m* > 1 at some unique 
point t = t* S (0,oo). In this case, we show that the main contribution to 
M n comes from the intervals of length d* logra + a-v/logn, o£l, where 

d* = l/(p(t t ) > 0. In the sublogarithmic case we assume that the tail of Xk is 
heavier than exp{— x 2 ~ £ }, for some e > 0. In this case, the main contribution 
to M n comes from the intervals of length o(logra) and in fact, under regularity 
assumptions, from the intervals of length 1. In the remaining, fourth case, the 
Xfc's are Gaussian. This case has been studied earlier in the literature. The 
main contribution comes from intervals of length alogn, a > 0. We argue that 
our results cover most interesting distributions with light tails. The proofs 
are based on the precise asymptotic estimates for large and moderate devia- 
tion probabilities for sums of i.i.d. random variables due to Cramer, Bahadur, 
Ranga Rao, Petrov and others, and a careful extreme value analysis of the 
random field of standardized increments by the double sum method. 



1. Introduction and statement of results 

1.1. Introduction. Suppose we are given a long sequence of observations. The 
observations arc assumed to be independent identically distributed (i.i.d.) random 
variables with zero mean and unit variance, except, possibly, for a short interval, 
where the observations have positive mean. This interval may be interpreted as a 
signal in an i.i.d. noise. The question is how to decide whether a signal is present 
and if yes, how to locate it. A natural approach is to build a multiscale scan statistic. 
For every interval we compute the sum of the observations in this interval divided 
by the square root of the length of the interval. Large values of this normalized sum 
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indicate the presence of a signal. Since no a priori knowledge about the location 
and length of the interval containing the signal is available, we take the maximum 
of such normalized sums over all possible intervals of all possible lengths. Scan 
statistics with windows of fixed size have been much studied; see, e.g., [121 113) . 
The scan statistic we are interested in is built using windows of all possible sizes. 
In order to use this statistic for testing purposes we need to know its asymptotic 
distribution under the null hypothesis. 

We arrive at the following problem. Let Xi,X2,--- be i.i.d. non-degenerate 
random variables with = 0, VarXj, = 1. Consider a random walk given by 
Sk = X\ + . . . + Xk, k G N, and So = 0. For n £ N define the multiscale scan 
statistic M„ by 

(1) M„= max - V \ §. 

0<i<j<n y/J — I 

Following results on the asymptotic behavior of M„ asn->oo are known. For ran- 
dom variables with finite exponential moments, Shao |35j . confirming and extending 
a conjecture of Revesz [33] , proved that 

(2) lim -%L= = a.s. 

n^oo y/2 log n 

Here, m* £ [1, oo] is a constant determined explicitly in terms of the distribution of 
X\. Shao's proof has been considerably simplified by Steinebach [35]; see also [TS] 
for a multidimensional generalization. This describes the a.s. rate of growth of M n . 
But what about the limiting distribution? In the case when X\ 1 X2, ■ ■ ■ are i.i.d. 
standard Gaussian, Siegmund and Venkatraman |36| showed that for all t£I, 

r , 5 log log n + log tA= + r n 

(3) lim P M n < y/2\ogn+± 2V?r = exp{-e~ r }. 

n-s-oo L V 2 log 71 J 

Here, H > is some explicit constant. The distribution on the right-hand side is 
the Gumbel extreme- value law. An independent proof of the same result was given 
in [T7] . It was shown in [THl [T7] that a result similar to ([3| , but with a different 
normalization, holds if we replace the Gaussian random walk by a Brownian mo- 
tion. Generalizations of both results to the multidimensional setting with intervals 
replaced by cubes or rectangles, have been obtained in [18]. Similar problem for a 
totaly skewed a-stable Levy process has been considered in [16]. In the case when 
X\ has regularly varying right tail, limit Frechet distribution for M„ has been 
obtained by Mikosch and Rackauskas [2^; see also [2"5] . 

Apart from these special cases nothing has been known about the limiting dis- 
tribution of M„. Our aim is to settle this problem for a broad class of random 
variables with light tails. It turns out that the behavior of M„ depends heavily on 
some fine properties of the distribution of X%. We assume that for some a > 0, 

(4) <p(t) := logEe tXl < oo for all t > -cr - 

The function ip (called the log-Laplace transform of Xi) is strictly increasing on 
[0, oo), strictly convex, infinitely differentiable, and vanishes at 0. We will consider 
four cases depending on where the supremum of the function 
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is attained. The constant m* in Shao's result ([2| is determined by m* = sup t>0 ?p(t). 
Note that limtj,o tp(t) = 1 since ip(t) ~ t 2 /2 as t J, 0. Hence, m* > 1. If Xi is 
standard Gaussian, we even have tp(t) = 1 identically, for all t € K. Our four cases 
can be roughly described as follows, see Figure [T] 

(1) Gaussian case: ip(t) — 1 for all t€R, 

(2) superlogarithmic case: the supremum m* = 1 is attained as t J, 0. 

(3) logarithmic case: the supremum to* > 1 is attained at some £ = i* G (0, oo). 

(4) sublogarithmic case: to* = +oo 

Since the Gaussian case has been fully analyzed in [36j [17j [TS] , we concentrate on 
the remaining three cases. Let us explain the difference between the cases. The 
definition of M„ involves a maximum taken over intervals of different lengths 
I := j — i. It turns out that different lengths make different contributions to M„. 
In all three cases we will single out some family of lengths which are optimal in the 
sense that the contribution of all other lengths to M„ is asymptotically negligible. 
We will show that the optimal lengths are given as follows: 

(1) Gaussian case: I — alogn, a > 0. 

(2) superlogarithmic case: I = alog p n, a > 0, where p > 1. 

(3) logarithmic case: I = rf* log n + aylog n, a£l, where > 0. 

(4) sublogarithmic case: I — o(logn), and, under more assumptions, I = 1. 
Here, p > 1 and d* > are parameters depending on the distribution of X± . To give 
exact meaning to these statements we will analyze the random variable M n (fei, /12) 
obtained by restricting the lengths over which the maximum is taken to some range 
\h\,h,2\. Namely, for < hi < 1%%, define 

(6) M n (h 1 ,h 2 )= max ^=M. 

0<i<j<n y/j — I 
hi<j — i<h2 

1.2. The superlogarithmic case. Here we consider random variables which are 
in some sense dominated by the Gaussian distribution. We assume that for all 
£ > 0, 

Equivalently, i/j(t) < 1 for every t > 0, and limsup^^ ip(t) < 1. A closely related 
notion is the subgaussianity; see [7]. The first non-zero term in the Taylor expansion 
of cp(t) at is t 2 /2 since we assume that EX*, = 0, Va.r Xk = 1. Of crucial 
importance will be the second non-zero term in the Taylor expansion of f(t). We 
have, for some q € {3, 4, . . .} and n > 0, 

(8) <p(t) = ~-Kt*+o(t*) t no. 

Thus, q is the order of the first non-zero cumulant of X±, not counting the variance. 
The most common value of q is 3, however, for symmetric distributions the third 
cumulant vanishes and we typically have q — 4. Note that the coefficient k cannot 
be negative, since otherwise ^ would be violated for sufficiently small e > 0. 

Theorem 1.1. Let Xi,X 2 , . . . be i.i.d. random variables with EX^ = 0, VarXj, 
1 . Suppose that conditions Q, Q, Q are satisfied. Then, for all reR, 

"1 13—6 1 

(9) lim P -Ml < log(nlog5 ^ n ) + T = exp { _ A e - T } } 




Figure 1. The graph of ip(t) — 2ip(t)/t 2 in the Gaussian 
case (black), superlogarithmic case (blue), logarithmic case (red), 
sublogarithmic case (green). 



whe 



V. = ^r(^)(2«)^ 



It turns out that in the superlogarithmic case, the main contribution to M„ is 
done by intervals with length alog p n, where a € (0, oo), and 

5 3 7 

- ; :■(. •>. 

q-2 

Moreover, we will even prove that the "intensity" with which the length a log p n 
contributes to M n is given by some explicit function A 9iK (a). Namely, we have the 
following result. 

Theorem 1.2. Fix arbitrary < A\ < A 2 . Define l~ = Ailog p n and = 
A2\og p n. Under the same assumptions as in Theorem ]! . 1\ for every r G BL 

(11) lim P 1+) < log(nlog5-f^n)+rl = cxp{- e - T f A q ^(a)da\, 

n-yoo 12 J I J Ai J 

where A giK : (0, oo) —¥ (0, oo) is a function given by 

(12) A g , K (a) = — l —exp{-K2ia-^}. 



Note that A qyK = f^ A q lt (a)da, so that formally we can obtain Theorem 



1.1 



from Theorem 1.2 by taking A\ = 0, A 2 — oo. Note that A 9iK (a) -4 as a j or 
a j" oo. This means that too small and too large intervals make small contributions 
to M„. The unique maximum of A g K is attained at 

g-4 2 2 

a* = 2i- 2 ki- 2 (q ~ 2)i~ 2 . 

Thus, the largest contribution to M„ comes from the intervals of length (a* + 
o(l))log p n. 

1.3. The logarithmic case. We assume that Q holds and there is t* > such 
that 

(13 , m . :=m > , 
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Moreover, we assume that t* is the unique point of maximum of ip(t) = 2(p(t)/t 2 in 
the following uniform sense: for every e > 0, 



(14) 



¥>(*) 

0<t<t„-e t / 2 



and 



SUp ttttt: < ni* 

*>*,+£ 



i 2 /2 



Equivalently, ^(t) < m* for all t > 0, f ^ t*, and limsup^^ ^(i) < to*. Recall 
that a random variable Xi is lattice if there are a > 0, b £ K such that Xi € aZ + 6 
with probability 1. Otherwise, Xi is called non-lattice. 

Theorem 1.3. Let X\,X2, ... be i.i.d. random variables with EXk — 0, VarX^ — 
1. Suppose that conditions ( |13[ ) ; (14) are satisfied and the distribution of Xi 
is non-lattice. Then, for every r€l, 



(15) 



where 0* = 



(16) # 



lim 



-M^ < to* (log n + r) = expj - 6,e T |, 



Tf^rS cr* = V (**) and 



V erf TO* / 



lim — E 

B-s-oo £? 



max e 

fc=0,...,B 



t,S h -k<p(t,) 



e(o,i). 



8m, Vcr| 

More explicit expression for H* will be given in Section [478] below. We believe 
that in the lattice case the convergence in Theorem 1.3 breaks down. (A simi- 
lar phenomenon was observed in [22j for scan statistics with fixed window size). 
However, we still have tightness. 

Theorem 1.4. Let Xi,X2, ... fee i.i.d. random variables with EXk = 0, VarX^ = 
1. Assume that conditions B, (13), (14) hold. Then, the sequence of random 
variables M 2 — 2m, log n, n 6 N, is tight. 

In the next theorem we compute the contribution of different lengths to M n . Let 
ci* = l/y>(t*). We will show that only intervals whose length differs from d, logn by 
a quantity of order i/logn are relevant. Recall that M n (/ii, h-z) was defined in 

Theorem 1.5. Fix arbitrary A\ < A 2 . Define l~ = d*logn + Axy/l ogn and 
ly t = d, logn + A2\/logn. Under the same assumptions as in Theorem 1.3, for 
every r£l, 

rAi 



(17) lim P 

n— yoo 



rl 



M 2 n (l-,l+) < (logn 



exp • 



Here, : I 

are as in (|16 1. 



(0, oo) is a function given by O(a) 



2„2 



9(o)da|. 

, where j3* and 



Note that O* = Q(a)da, so that we can obtain (at least formally) Theo- 
rem 



1.3 from Theorem lL5j by taking A\ = — oo, A2 = 00. Since O(a) attains its 



maximum at a = 0, the maximal contribution to M n comes from the intervals with 
length d* logn + o(ylogn). 



1.4. The sublogarithmic case. In this case we consider random variables whose 
right tail is heavier than the standard Gaussian tail. In this case only intervals of 
length o(logn) make contribution to M„, as the next theorem shows. 
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Theorem 1.6. Let X%, X2, ■ ■ ■ be i.i.d. random variables with EXk = 0. Var Xk = 1 
and such that ip(t) = logEe*^ 1 is finite on [Q,to), for some to > 0. Assume that 
for some a < 2 we have P[Xi > x] > e~ x , for sufficiently large x. Then, for every 
a> 0, 

(18) lim P[M n = M„(l,alogn)] = 1. 

Under some regularity assumptions on the tail of X\ it is possible to show that 
only intervals of length 1 (that is, only individual observations) contribute to M n . 
In this case, the study of M„ is equivalent to the study of the maximum U„ = 
max{Xi, . . . , X n }. We assume that for some a G [1, 2) and D > 0, 

(19) lim — logP[Xi > x] = -D. 

x— x a 

Theorem 1.7. Let Xi, X 2 , ... fee i.i.d. random variables withEX^ = 0, VarXj. = 1 
and such that (19 1 holds. Then, lim„_>. 00 P[M„ = U„] = 1. 

The above results show that the square-root normalization we used in ([!]) is not 
very natural in the sublogarithmic case. See [351 1231 13H [Ml US for other types of 
normalization which can be used in the case of sublogarithmic (or even heavier) 
tails. 

1.5. Examples. In this section we show that most classical families of distributions 
considered in the probability theory belong to one of the four cases considered above. 

1.5.1. Symmetric Bernoulli. Let X\, Xi, ... be i.i.d. with symmetric Bernoulli dis- 
tribution, that is P[X k = ±1] = 1/2. We have 

i 2 f 4 

<p(t) = logcosh(i) = - - — + o(t 4 ), t -> 0. 
We are in the superlogarithmic case. Indeed, all coefficients of the Taylor series 



2 

k=2 



cosht- e f 1 = - > r 



2^" \(2k)\ 2 k k\ 



are negative. This shows that fit) < t 2 /2 for every t > 0. Since we also have 
limt-j.oo tp(t) /t 2 — 0, it follows that condition Q is fulfilled. We are in the super- 
logarithmic case with q = 4. The optimal lengths are a log 2 n, a > 0. 

1.5.2. Non-symmetric Bernoulli. Fixpg (0, 1) and consider i.i.d. Bernoulli random 
variables Y U Y 2 , ■ . ■ with F[Y k = +1] = p and F[Y k = -1] = 1 - p. Consider also 
the normalized random variables 

X k = -pT k -(2p-l)), a=^Ap{l-p). 
a 

Then, EX k — and VarXfc = 1. The log-Laplace transform of X k is given by 

(20) <p(t) = -— + log(l + pe? - p ) = L + i-^i 3 + (t 3 ). 

a 2 3<T 

The graph of the function ip(t) — 2ip(t)/t 2 in dependence on the parameter p is 
shown in Figure [2] As already shown above, for p = 1/2 we are in the superloga- 
rithmic case with 9 = 4, and the optimal lengths are a log 2 n, a > 0. 

Proposition 1.8. If p € (1/2,1), then ip(t) < t 2 /2 for all t > 0. We are in the 

superlogarithmic case with q = 3. The optimal lengths are a log 3 n, a > 0. 
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FIGURE 2. The graph of ip(t) = 2ip(t)/t 2 for the non-symmetric 
Bernoulli distributions in dependence on the parameter p. Blue 
curves correspond to p > 1/2 (superlogarithmic case), red curves 
correspond to p < 1/2 (logarithmic case). On each red curve the 
point of maximum (£*,m*) is shown. The black curve on the 
boundary between blue and red corresponds to p — 1/2 (super- 
logarithmic case). 



For p < 1/2 the coefficient of t in the Taylor series (20) is positive. This implies 



that to* = sup t>0 tjj(t) > 1. Also, it follows from (20 1 that Umt_>oo i>{t) — 0, hence 
the maximum to* is finite. From Figure [2] we see that the maximum is attained 
at a unique point. (We were not able to prove this fact rigorously. It seems that 
the proof requires tedious computations with transcendental functions). Thus, we 
should be in the logarithmic case. The optimal lengths are log n + ayTog n, 
o£R. Here, <i* = l/tp(t*), where t* is the solution of the transcendental equation 
2stp'(t*) = <p(t*). 

1.5.3. Binomial. If some distribution satisfies the superlogarithmic or the logarith- 
mic assumptions, then the same is true for its convolution powers. More precisely, 
this means the following. Let X\, X2, ... be i.i.d. random variables with distribution 
function F(z). Let Xi,X2, ... be i.i.d. random variables with distribution function 
F* m (zy/m), where to e N is fixed, and F* m denotes the m-th convolution power 
of F. Then, the log-Laplace transform of Xk is given by (p(t) = m<p{t/^/m). The 
equality 



entails that if X\ satisfies the superlogarithmic or the logarithmic conditions, then 
the same holds for the variable X\. For example, our results on the Bernoulli 
distributions imply that the binomial distribution Bin(fc,p) (after standardization) 
belongs to the superlogarithmic case for p > 1/2 and to the logarithmic case for 
p<l/2. 

1.5.4. Uniform. Let X±, X2, ... be i.i.d. random variables with uniform distribution 
on the interval [— v3, v3], so that EX^ = and VarX^ = 1. We have 

. . , sinh(v / 3i) t 2 i 4 . 
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We are in the superlogarithmic case with q = 4. To see this note that 

sinh(V3t) p t 2 /2 = vVfc/ 3 * 1 

V( 



k=1 



,(2fc + l)! 2 k kl 



All coefficients are negative, as one easily verifies by induction. The optimal lengths 
are a log 2 n, a > 0. 

1.5.5. Gamma, Negative Binomial, Poisson. The former two distributions (includ- 



ing exponential and geometric as special cases) are covered by Theorems 1.6 and 1.7 



The Poisson distribution is covered by Theorem |1.6[ Although it does not satisfy 
the assumptions of Theorem |1.7[ it is easy to check that the conclusion of this 
theorem remains valid in the Poisson case. The square root normalization in the 
definition of M„, see ([I]), is thus not natural for these distributions. See [3"§I [57] 
for alternative normalizations. 

1.6. Remarks. We sketch some possible extensions and modifications of our re- 
sults. 

1.6.1. M„ versus M 2 . In order to simplify the formulas, we stated our results for 
M 2 instead of M„. It is easy to translate everything to M n : if M 2 — a n converges 
weakly to some distribution G for some sequence a n —¥ +00, then 

(21) 2Va n (M n ~ v^T) A G. 

n— ►oo 

Here is the proof of this implication. Note that P[M„ < 0] goes to as n — > 00 since 
it can be estimated above by P[U„ < 0], where U„ = max{Ai, . . . , X n }. Hence, 
for every rel, 



M„ < 



Mi <a n + r- 



T 

4a, 



o(l) 



By our assumption, the right-hand side goes to G(r), for every r 6 E where G 



is continuous. This yields (21). Similar argumentation applies to Theorems 1.2 
and[L5l 



1.6.2. Hitting times. It is possible to state our main results, Theorems |l.l| and |1.3| 
in terms of the hitting time 

(22) T{u) = min{n € N : M„ > u}, u> 0, 

rather than in terms of M„. This approach was used in [36 . There, T(u) was 
introduced as a stopping rule for a sequential change-point detection. It turns out 
that T(u) has limiting exponential distribution, as u — > 00. The Gaussian case was 
analyzed in |36j . In the non-Gaussian case we have the following two results. 



Proposition 1.9. Let the assumptions of Theorem 
Then, for every y > 0, 



1.1 



be fulfilled. Let a - i • ^ 



2 q-1 ■ 



(23) 



lim P 



- a 2 a ^ — 



e ^ T{u) > y = exp{-A 9iK y}. 



Proposition 1.10. Let the assumptions of Theorem \1.3\ be fulfilled. For every 

y>o, 



(24) 



lim P 

u— >oo 



T(u) > y = cxp{-e*y}. 
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Proof of Propositions \n\\rW[ Fix y > Let n = 2 a U - 2a e u2/2 y (for Proposi 



tion 



With r 



(25) 



(26) 



1.9|| or n — e u /( 2m *)y (for Proposition 1.10). Note that n need not be integer. 

— logy, we have, as w, n — > oo, 

u u 

— = log([n] \og a [n]) +t + o(l), resp. — = m»(log[n] + r). 

"[M„ < 0] = 0. In the case of Proposi- 
ti. 



Recall from Section 1.6.1 that lim n _ 
tionflJl 



-Ct„,2ct r 



V T(«) >y 



\M, 



< u] 



~-Mf 



< 



o(l) 



Taking into account (25 1 and applying to the right hand side of (26) Theorem 



1.1 



we obtain that the limit of the right-hand side of (26) is e A i,kV _ Proposition 1.10 
is proven analogously. □ 

1.6.3. Two-sided version o/M„. If in the signal detection problem mentioned at 
the beginning of the paper we do not know whether the signal has positive or 
negative mean, it is natural to consider |M„| = max{M+ , } as a test statistic, 
where 

Sj Si 



M+ = M„ = max 3 i 



M~ = - 



0<i<j<n y/j — i ' " 0<i<j<n j — i 

Large values of M+ (resp., M^) indicate the presence of a signal with positive 
(resp., negative) mean. Since M~ is obtained from M n by the substitution Xk ^ 
— Xk, our results (under appropriate assumptions) yield limiting distributions for 
both M+ and M~. Moreover, M+ and M~ become asymptotically independent 
asn-> oo. We leave this fact without a proof, but note that for i.i.d. random vari- 
ables it is well-known that the maximum and the minimum become asymptotically 
independent as the sample size goes to oo. If the X^s have symmetric distribution 
and if M„ has a limiting distribution of the form exp{— be~ CT }, for some constants 
6, c > 0, the asymptotic independence implies that |M„| has limiting distribution 
of the form exp{— 2be~ CT }. For non-symmetric Xk, it is possible that M+ and 
belong to different cases. If this happens, the case with the larger normalizing 
sequence determines the behavior of |M„|. 

1.6.4. Non-unique maximum. Among the distributions satisfying Q there some 
exotic examples which are not covered by our results. For example, it is possible 
that the supremum of ijj (which is strictly larger than 1) is attained at several 
points ti, . . . ,t m > simultaneously. In this case, Theorem [T3] still holds, but the 
constant 0* in ( 15 ) has to be replaced by 0i + . . . + Q m , where the summands 0j 



correspond to the contributions of the different Vs. It is however not possible that 
the maximum of ip is attained at some interval (or some set having a limit point in 
[0, oo)). This follows from the uniqueness theorem for analytic functions. (Note that 
i\) can be extended analytically to the right half-plane). It is also possible that the 
maximum of i\) is equal to 1, but is attained at t\ = and some other point t 2 > 0. 
The first point is described by Theorem |1.1| with normalization sequence ai iM = 



log(nlog 3 9^ n), the second point is described by Theorem 1.3 with normalization 
sequence 02, n = logn. If q < 6 (which is usually the case), then a 2 , n ~ o-i,n ^ +°° 
and the contribution of t\ = is asymptotically negligible. Our results do not cover 
the situation in which tfj(t) < 1 for all t > 0, but lim^oo ip(t) = 1. It is, however, 
difficult to find a distribution with these properties. 
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1.6.5. Strong approximation. The first naive attempt to obtain the limiting distri- 
bution for M„ is to approximate the random walk S n by a Gaussian random walk 
W n using the strong invariance principle of Komlos-Major-Tusnady |10j . We will 
now explain why this approach fails. If we exclude the case in which S n is itself 
Gaussian, the best possible rate of strong approximation is \W n — S n \ = O(logn) 
a.s.; see [10]. Given < i < j < n with I — j — i we obtain for the difference 

Wj - W z S 3 - Sj 

Vj - i Vj-i 

the estimate Oilogn/ \/Z). If we want to apply this to show that the weak limit 
theorem satisfied by M„ has the same form in the Gaussian and in the non-Gaussian 
case, the approximation error should be of smaller order than the fluctuations of 
M„, which are of order 1/yTogri; see, e.g., ^. Thus, we obtain a sufficiently 
accurate strong approximation if I is of larger order than log 3 n. However, our 
results show that the behavior of M„ is determined by the intervals of length at 
most 0(log 3 n). In this domain the strong approximation is too inaccurate. The 
best one can prove using a direct strong approximation argument is the following 
result: for any sequence l n such that ^„/log 3 n — > oo but l n /n — > 0, there is a 
sequence b n not depending on the distribution of X\ such that M^(Z„,n) — b n 
converges to the Gumbel distribution. In the Gaussian case this can be proved by 
the methods of [17] . Then, the strong approximation implies that the same limit 
theorem holds for any X\ satisfying Q. 

1.7. Notation and strategy of the proof. First we fix some notation which will 
be used throughout the paper. Let {X k ,k £ Z} be non-degenerate i.i.d. random 
variables with HX k = 0, VarX/j = 1. We always assume that Q holds. Define the 
two-sided random walk {S k , k £ Z} by 

S k =X 1 + ...+X k , S o = 0, S_ k = X_ 1 + ... + X_ k , fceN. 

Consider the set I = {(i,j) E Z 2 : i < j}. Our main object of study is the 
standardized increments random field Z = {Zij : G 1} defined by 

(27) ^. = ^4 (i,i)Gl. 

Elements G I will be called intervals. Any interval €E I will be identified 
with Zij, the corresponding standardized increment, as well as with the set {i + 
1, . . . , j}. We call I := j — i e N the length of (i, j). 

We are interested in the random variable M„ = max^ ^gi^ ^i.j, where I„ C I is 
a set of intervals given by 

(28) I„ = {(i,j)eZ 2 :0<j<j<n}, n £ N. 

Clearly, M„ is a maximum of random variables, but these are neither independent, 
nor identically distributed. We prove our results by a careful extreme- value analysis 
of the field Z. Let us explain the method of the proof. We will take an appropriate 
threshold u n —¥ oo and compute the limit of the exceedance probability P[M„ > u n \. 
The first step is to compute the individual probability ¥[Zi j > u n ] for intervals (i, j) 
having "optimal length" l n . To this end, we need classical limit theorems which 
will be recalled in Section [5J The optimal length is chosen as in Section [O] The 
second step is to compute the local probability Pfmax^j)^ where T c I is a 
discrete square with side length of order q n . Here, q n is chosen to be the "extremal 
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decorrelation length" of Z. This means that the exceedances of Z at two points of I 
at distance of order q n retain non-trivial asymptotic dependence in the large n limit 
(that is, they become neither competely dependent, nor completely independent). 
There is way to characterize q n using the language of the Poisson clumping heuristic; 
see pQ. The intervals for which Z^j > u n form clumps distributed randomly 
in I. Then, the linear size of these clumps is of order q n . From this part of the 
proof it will be clear why it is not possible to choose the "optimal length" as large 
as possible: intervals with large length are strongly dependent (meaning that the 
decorrelation length is large there) and therefore make small contribution to M„. 
The third step is to compute the exceedance probability over a domain of size 
much larger than q n . Such domain can be decomposed into many small domains 
of size q n and the exceedance events over these small domains are asymptotically 
independent due to the extremal decorrelation. The asymptotic independence is 
shown by estimating the double sum appearing in the Bonferroni inequality. Thus, 
we can apply the Poisson limit theorem for weakly dependent events. 

On a rigorous level, there are several (closely related) powerful methods to an- 
alyze extremes of random fields; see [24] [32] El Q] . We use a modification of the 
double sum method of Pickands [3D]; see also Leadbetter et al. [Ml Chapter 12], 
Piterbarg .32, Chapter D]. Originally, the method was used to analyze extremes of 
Gaussian processes, but it can be applied to non-Gaussian scan statistics as well; 
see [6] |33j [31] [8] . These references deal with fixed window size, for an example with 
variable window size see [20]. A related method was also used by [37]. 

Throughout the paper C\, C<x, . . . and C\, C2, . . . are positive constants which may 
change from line to line. They may depend on the distribution of X\ and parameters 
specified in the text. We write a n ~ b n if lim^^oo a n /b n = 1. Let No = {0, 1, 2, . . .}. 



2. Results on large and moderate deviations 



In our proofs we will make a heavy use of the exact asymptotic results for 
the probabilities of large and moderate deviations of sums of i.i.d. random vari- 
ables. Recall from Q that ip is the log-Laplace transform of the X^s. Define the 
Legendre-Fenchel transform of <p: 



(29) 



/GO 



sup(s< 

t>0 



<p(t)), s>0. 



Let Soo = sup{s e M : ¥[Xi < s] < 1} > be the right endpoint of Xi.Then, I 
is a finite, strictly convex, strictly increasing, infinitely differentiable function on 
[0, Soo), and Z(s) = +oo for s > Soq. Also, we have Soo = limsup^^ <p(t)/t. 

The next theorem on the probability of "moderate" deviations proved originally 
by Cramer et al. [9] has been subsequently strengthened by Feller [11], Petrov [27] 
and Hoglund [14]; see also [29] and [15] . 

Theorem 2.1. Let Xk > be a sequence such that Xk — > oo but Xk — o(V~k) as 
k — > oo. Then, as k — > oo, 



(30) 



> x k 



1 



V2 



■KX k 



exp 



Often, it is more convenient to introduce the Cramer series A(y) = y 3 ( — 



I{y)) and state (30 1 in the following equivalent form (which is valid without the 
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requirement Xk 
(31) 



oo): 



S k 



4> 



Here, $>(x) is the tail lunction of the standard normal distribution. Relations (30) 

_J_ p -x 2 /2 



and (31 ) are equivalent since $>(x) 



/2-rx 



as x — > oo. Note that for Xk being 



constant the central limit theorem is recovered. 

The next theorem due to Bahadur and Ranga Rao [3J and Petrov 
the probabilities of "large" deviations of Sk- 



deals with 



Theorem 2.2. Assume that X\ is non-lattice. Let Xk > be a sequence such that 
Xk ~ ay/k for some a > 0, as k — > oo. Then, as k — > oo, 



(32) 



Sk 
Vk 



1 



1 



■ exp 



V2nk~r(a)o-(a) 
Here, a 2 (a) = <p"(I'(a)). 

For lattice variables the theorem should be modified; see In fact, Theo- 

rems 2.1 and 2.2 can be included as special cases in a general result; see [14]. We 
will need just the following inequality; see [27], [14]. It is valid both for "moderate" 
and "large" deviations, both in the lattice and in the non-lattice case. 

Theorem 2.3. For every A £ (0, Sqo) there is a constant C — C(A) such that for 
allkeN, xe (0,AVk), 



(33) 



rS fe 



> X 



< 



c 



-»{- w (£)}- 



The next lemma is elementary and well-known. It is weaker than Theorem 2.3 
but valid without restriction on x. Instead of Q we assume that ip(t) = log Ee tAl 
is finite on [0, too), for some too > 0. 

Lemma 2.4. For every k S N and x > 0, we have 

S k 



Vk 



> X 



- exp {- fc/ (^)} 



Proof. By Markov's inequality F[S k > Vkx] < e -tVkx+k<p(t) ^ for every f > q Take 
the minimum over t > 0. □ 



3. Proof in the superlogarithmic case 

In this section we prove Theorem |1.1| and Theorem |1.2| It will be convenient 
to pass from conditions ([7| and Q to their Legendre-Fenchel conjugates. We will 
assume that for every e > 0, 



(34) 



inf 44 > 1. 



5 > £ s 2 /2 

We also need the Taylor expansion of / at 0: with k > 0, 



(35) 



I(s) = -+ns q + o(s ( >), s±0. 



Proposition 3.1. Assume that (|4|) holds. Then, conditions ^ and (34| are equiv- 
alent. Also, conditions ^ and (35) are equivalent. 
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Proof. Assume that Q holds. Let e > be arbitrary. By ([7| we can find c 
c(e) < 1 such that ip(t) < ct 2 /2 for all t > e. It follows that for every s > e, 



I(s) = sxvp(st — (p(t)) > swp(st — (p(t)) > sup I st — ) 

t>0 t>e t>E ^ 2 / 



s 

2c' 



^- is attained at 



Note that the last equality holds since the supremum of st 
t = ~ > e. It follows that (34 1 holds. The proof that p4| implies Q is analogous 
Assume now that pi holds. Note that ip is analytic in a neighborhood of zero. 
Taking the derivative, we obtain (f'(t) = t — qnf 1 ^ 1 + o(t 9_1 ). Now, by Legendre- 
Fenchel duality, I' is the inverse function of if' . Taking the inverse function of ip' 

« g ~ 1 + o(s 9_1 ). Integrating, we obtain (35). The proof of 



we obtain I'(s) = 

the converse implication is analogous. □ 

Recall the notation p = . Fix some r G R and define a normalizing sequence 
u n = u n (r) > by 



(36) 



21og(nlog 2 p n) + 2r. 



Throughout the remainder of Section[3]we assume that conditions ( 34 1 and ( 35 ) are 
satisfied. Sections |3.1f)3.4| are devoted to the proof of Theorem |1.2| In Section [3~5| 
we complete the proof of Theorem 

3.1. Individual probability. The first step is to compute the probability that 
the random variable 7L%j exceeds some large threshold at some individual point 
€ I. We will consider intervals whose length l n is optimal, that is 

l n ~ a log p n, for some a > 0. 

Lemma 3.2. Let l n £Nfea sequence such that a := lim„_ i . 00 l n / log p n € (0,oo). 
Let s £ K be fixed. Then, as n — » oo. 



P n (s) :=P 

Proof. Note that u r , 
Theorem [2~T1 



Si 



s 

u„ 



1 



„s-k2 2 a 



n log 2 p n 



(37) 



Pn(s) 



o(v^n) since u 
1 



\J1 log n by (36) and p > 1. By 



2ttu, 



exp 



We evaluate the term under the sign of the exponential. Since u r , 
l n ~ alog p n, and p = ^32, we have 

s 

= 2^a-^l- 1 +o(l- 1 ). 



y/2 logn, 



Using the Taylor expansion in (35), we obtain 



Inserting this into (37) yields 
(38) 



s 

U,, 



+ K2?a 



o(l). 



Pn(s) 



2ttu, 



exp 



To complete the proof of Lemma 3.2 note that e 11 ™/ 2 = e T n\og 2 p n and 
y/2 log n by po). 



□ 
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3.2. Local probability. The next step is to compute the exceedance probability 
over a small discrete square in the space of intervals. Given an interval (x, y) G I 
of length I :— y — x and a "length fluctuation" r £ N, let T r (x, y) be the set of all 
intervals £ I satisfying 

x — r < i < x and y < j < y + r. 

Note that all intervals from the set T r (x,y) are extensions of the "base" interval 
(x, y). We can view T r (x,y) as a discrete square with side length r in the grid 
I C Z 2 . The base interval (x,y) corresponds to the right bottom vertex of this 
square. The cardinality of T r (x, y) is r 2 . Fix any sequence q n € N satisfying 



q n ~ log'' * n as n oo. Note that q n — > oo since p > 1. 

Lemma 3.3. Let i n eNfea sequence such that a := lim„_ i . 00 l n / log p n £ (0, oo) 
fia: L? G N and write T„ = Ts q „(x,x + £ n ). As n — > oo, 



(39) 



max 

(i,j)eT, 



P„(0).{l + ff =(f)}, 



where P n (0) is as in Lemma 3.2, the junction H : (0, oo) — » (0, oo) is defined by 



(40) 



=E 



sup e 
te[o,B 



V2W(t)-t 



5 >0, 



where {W(t),t > 0} is a standard Brownian motion. 

Proof. The idea of the proof is to represent the field Zij as its value at the base 
interval (x,x + l n ) plus some incremental process. In order to exceed the level u n 
over T n either the value Z x ^ x ^.[ n at the base interval should be larger than u„, 
or this value should be of the form u n — u^s, s > 0, and the supremum of the 
incremental process should be larger than u^s. We will show that the incremental 
process converges to the sum of two independent Brownian motions. 

Let x9~\ xf , k G N, be i.i.d. random variables with the same distribution as 
the Xfc's, and which are independent of the X^s. Define two independent random 
walks 



(41) 



(0 



X 



k£ N , i = 1,2. 

Let V n be a random variable defined by Z XiX+ i n = u n — u^Vn. Then, since any 
interval from T„ has the form (x — k\, x + l n + k%) with some integers < hi, k,2 < 
Bq n , we have 



max 

0<k 1 ,k 2 <Bq„ 



> U r , 



y/l n + ki+k 2 

By taking k\ = k% = we see that the maximum on the right-hand side exceeds 
u n if V n < 0. Note that P[V n < 0] = P n (0), the probability which was evaluated 
in Lemma 



3.2 



The random variables V n , are independent. Conditioning 



on V n = s and integrating over s > 0, we obtain 



(42) 



Qn = Pn(0) 



G n (s)dn n (s), 



where fi n is the probability distribution of V n and G n is a non-increasing function 
defined by 



G n (s) 



max 

0<fci,fe 2 <-B9„ 



S 



(i) 



K2 



In) 



> 
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By Lemma [3T2| for every s > 0, 

(43) lim MMl = lim 5^ = e s, 

K ' n->oo P„(0) n-Xx> P n (0) 

Let s n be any sequence converging to s > 0. We will compute linin^oo G n (s n ). We 
have 

(44) G n (s„)=pf sup {W^(t 1 )+W^{t a )-fn{ti,t 2 ))>0 

where {w£\t),t e [0,B]}, i = 1,2, are stochastic processes and /„ : [0,-B] z 
is a function given by 

W^'{t) = ——, Jn{ti,t 2 ) 



if t, ti,t 2 G g^Zfl [0, B), and by linear interpolation otherwise. On the last interval 
of length q~ x we agree to use constant interpolation. Recall that u n ~ -\/21ogn, 
l n ~ alog p n, g„ ~ log p ~ n, as n — > 00. Elementary calculus shows that uniformly 
in h,t 2 G [0,B], 

fn(ti,h) f(ti,t 2 ) := - tj- 2 + J^s as n -t 00. 

V2a V * 

Given a compact metric space K let C(K) be the space of continuous functions 
on K endowed with the sup-metric. By Donsker's invariance principle, as n — > 00, 
the processes {Wn\ti),ti £ [0, B]}, £=1,2, converge weakly on the space C[0, B] 
to two independent standard Brownian motions {Wi(t{),ti € [0, -B]}, i = 1,2. 
Consider the map 

* : C[0, B] x C[0, B] x C([0, B] 2 ) -> K 

defined by 

*(iui,ix?2,/) = sup (wi(ti) + ui 2 (< 2 ) - f(h,t 2 )). 
ti,i 2 e[o,s] 

The map VP is continuous in the product topology. By the continuous mapping 
theorem, see Theorem 3.27 in [2T], it follows that the random variable 

*(Wi 1) M 2 \f n )= sup (W«(*i) + ^ 2) (i 2 ) -/„(*!, * 2 )) 
ti,t a e[o,B] 

converges in distribution to the random variable 

*(Wi,W a ,/)= sup (Wi(ti) + W a (ia)- \/~» 
h,t a e\o,B] v V2a V ^ 



ti,t 2 e[o,B] 

By the scaling property of the Brownian motion, the latter variable has the same 
distribution as */§ (Mi + M 2 — s), where M\,M 2 are two independent copies of the 
random variable 

M := sup (V2W(t) -t)>0. 

t£[0,a- 1 B] 

Here, {W(t), t > 0} is a standard Brownian motion. Note that the random variable 
M (and hence, M\ + M 2 ) has continuous distribution function. It follows that for 
every sequence s n converging to s > 0, 

(45) lim G n (s n ) = P[Mi + M 2 > s]. 
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Taking (43 1 and (45) together, we obtain, formally, 
,d/j, n (s) 



(46) lim 



G n (s) 



Pn(0) 



POD 

/ P[Mi + M 2 > s]e s ds = Ee Ml+Ma 
Jo 



(Ee 



M\2 



Recalling ( 42 ) we obtain the statement of the lemma. The first equality in ( 46 ) will 
be justified in Remark |3.7| after some technical preparations have been done. □ 



The next lemma gives a somewhat weaker statement than Lemma |3.3[ but this 
statement is valid under more general assumptions. The lemma will be used later 
to estimate the exceedance probability over the non-optimal lengths. Essentially, 
it states that the local exceedance probability can be estimated by the individual 
exceedance probability times some constant. 

Lemma 3.4. Fix constants B\, B 2 > 0. Then, for all x € Z, l,reN and all u > 
such that B\l > u 2 and r < B 2 lu~ 2 , we have 

,2 



Q{l,r, u) 



S j Si 

max — — ^ — 



> u 



< Ciu 



_1 exp{ 



u 

y 



c 2 u q r 



where the constants C\ and C 2 depend on B\ and B 2 but don't depend on x,l,r,u. 



Proof. Define two independent random walks Sj: \ i = 1,2 as in (|41j). Let Vi iU be 
a random variable defined by Z x ,sc+l = u — u~ l Vi^ u . Then, since any interval from 
T r (x, x + I) has the form {x — k\, x + I + k 2 ) with some integers < k\, k 2 < r, we 
have 



Q(l,r,u) = 



max 

0<fei,fc 2 <r 



Taking ki, k 2 = we see that the maximum on the right-hand side is non-negative. 
Conditioning on Vi jU = s and considering the cases s < and s > separately, we 
obtain 



(47) 



Q(l,r,u) =Fj,„(0) 



Gi, rtU (s)dFi jU (s), 



where Fi u is the distribution function of Vi u and 



(48) Gi <r>u (s) 



max 

0<fci,fc 2 <r 



S 



(i) 
fei 



> 



< 2P 



max b k > -7— 

0<fc<r ZU 



We estimate F^ u (s) for s £ [0, |u 2 ]. Write v = u — -. By the assumption v < \/B%l 
we can apply Theorem 2.3 and (34), (35) to get 



Fi lU (s) =P 



Vi 



> V 



-l r v 2 „, 1-21 

< civ exp | - y - c&w'Z 2 |. 



Here, the constants c\,c 2 ,... depend on B±, B 2 but don't depend on x,l,r,u. If 



s E [0, §w 2 ], then t> > it/4 and we obtain 



(49) 



2 

Fl,u( s ) ^ c 3 u~V exp | - — - c 4 u 9 r ^ j 



It is however easy to see that this inequality continues to hold for s > 3u 2 /4. Indeed, 
if C4 is sufficiently small, then the assumption B\l > u 2 implies that C4ii q l~( q ~ 2 ^ 2 < 
u 2 /8. Hence, is C3 is sufficiently large, the right-hand side of (49) is greater than 1 
and J49l holds. 
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We estimate Gi iTtU {s) for s > 0. Applying to the right-hand side of (48) the 
inequality stated in Theorem 2.4 on p. 52 in [25], we obtain 



Gl,r,u(s) < 



2u 



< 4P 



S <■ 



> c h s - V2 



< 4e 



In the second inequality, we used the assumption r < B 2 lu 2 . In the third inequal- 
ity we used Lemma 2.4 From (34) we obtain 

(50) G*,r,«(s) <c 6 e" C7s2 . 

Strictly speaking, this is valid only as long as c§s > \/2, however, we can choose 
the constant cq so large that (50) continues to hold in the case c$s < v2. It follows 
from d47b, ([49|, |50| that 



Q(l,r,u) < F liU (0) 
<c 8 (l + 



OO 

E 

k=0 



G l>r , u {k)F liU {k + l) 



fc=0 



_1 exp | 



C 4 U 9 / 2 

2 



}■ 



The proof of Lemma |3.4| is complete. 



□ 



Lemma 3.5. Let ^, z/ n , ueN, be measures on [0, oo) which are finite on compact 
intervals. Let G, G n , n € N, be measurable functions on [0, oo) which are uniformly 
bounded on compact intervals. Assume that 

(1) v n converges to v weakly on every interval [0,t], t > 0; 

(2) for v-a.e. s > and for every sequence s n — > s we have lim„_i. 00 G„(s„) = 
G(s); 

(3) limx_j. +00 J T \G n \dv n — uniformly over n 6 N. 
TTien, lim^oo / °° G„df„ = J °° Gdi/. 

Proof. Write F n (s) = ^„([0, s]) and -F'(s) = ^([0, s]), s > 0. By the first assumption, 
limn^oo F n (T) = F(T) for all T > 0, T £ V, where V is the set of (at most 
countable) discontinuities of F. By the third assumption it suffices to show that 
for every T > 0, T £ X>, 



(51) 



lim 



G n dv n = I Gdv 



Fix some T T>. By the first assumption and by Skorokhod's representation 
theorem we can construct (generally, dependent) [0, T]-valued random variables 
and £ on a common probability space such that — > £ a.s., P[£„ e B] = 
v n (B)/F n (T) and P[£ € B] = v(B)/F{T), for every Borel set B C [0, T]. It follows 
from the second condition that G„(£ n ) — > G(£) a.s. Since for uniformly bounded 
random variables the a.s. convergence implies the convergence of expectations, we 
obtain (toTl). □ 



Remark 3.6. We will verify the last condition of Lemma 3.5 by showing that 
v n {[0,s\) < ae C2S and \G n (s)\ < cge^ 4 * 2 for all (large) n G N. Then, 



/ \G n \du n < J2 u n ([k,k + l}) sup |G n (s)|< 

JT k=[T] se[k,k+l] 

which converges to uniformly in n £ N, as T — > +oo. 



oo 

c 5 e C2k e~ Cik2 
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Remark 3.7. We are in position to justify the first equality in (46 1. The first two 
assumptions of Lemma 3.5 with v n — /i„/P n (0), are fulfilled by (43 1 and (45). To 
verify the last assumption we use Remark |3.6| For s > we have, as established 



in (49) and Lemma 3.2 



~ x e s exp • 



2 



c</„" 2 } < c ie s 



For G n we obtain from (50) the estimate G n {s) < Gi„ t q n ,v, n ( s ) < c 2& C3S ■ Now, 
Remark 3.6 can be applied. 

3.3. Estimating the double sum. Given < A\ < A 2 we define l~ = A\ \og p n 
and Z+ = A 2 \og p n. 

Lemma 3.8. Let w n be any integer sequence such that w n /q n — > oo but w n — 
0(\og p n), as n — > oo. For z £ Z let $ n (z) be the set of all intervals £ I such 
that z < i < z + w n and j ~ i £ [l~ , Z+] . Then, as n — > oo, 



(52) 



where A(a) = A 9iK (a) 



max 



- T W n 



A(a)da, 



{i,j)es„(z) - J n JAl 

2 ip 2 exp{— k2% a~ } , a > 0, is as in (12) . 



Proof. By translation invariance we may take z = 0. Take some B £ N and recall 
that g n € N is a sequence satisfying g„ ~ log p_1 n. To get rid of the boundary effects 
we introduce two sequences e n and S n such that e n /q n — > oo and 5 n /q n — > oo, but 
e n = o(w n ) and S n = o(log p n), as n — > oo. Introduce the following two-dimensional 
discrete grids with mesh size q n : 



(53) J n {B) = {(x, y) £ Bq n Z 2 : x £ [-e n ,w n + e n ], y - x £ [l n - S n , 1+ + S n ]}, 



(54) J'JB) = {(x, y) £ Bq n Z 2 : x £ [e n> 



£ n ], y-x£ [l n + 5 n ,l+ - S n }}. 



Note that J' n {B) c J n {B). The discrete squares {Ts qn {%> y)}(x,y)ej„(B) (which 
were defined in Section [3.2| ) are disjoint and cover the set J n (0). Similarly, the 
discrete squares {^Bq n (x, y)}(x, y )ej' (b) are disjoint and contained in J„(0). By 
the Bonfcrroni inequality, we have, for every B £ N, 



(55) 

where 
(56) 

(57) 

(58) 



S' n (B) - SZ(B) < 



Sn(B) = 1 

(x,y)£j n (B) 

S' n {B) = J2 1 



S'^{B) = 



max Zi j > u r 
L(i,i)eJ„(o) ' J 



< S n (B), 



max Zij > u Tl 

(i,i)£T B ,„(i,i/) 



max Zij > u r , 

(i,j)e^B q „(.x,y) 



max Zi j > u n , max Zi j > u r 

(iJ)£lB<i n (zi,yi) ' (!j)6TB,„(l2,!/2) 



and in (58) the sum is taken over all pairs (xi,yi) £ J^ r [(B) and (xiiVi) £ i7 r '(£>) 



such that (xi, y\) =^ (x 2 , y 2 ). The statement of Lem ma|3 . 8| follows by letting n — > oo 
and then B — > oo in (55 ) and applying Lemmas 3.9[ 3.10 3.16 which we will prove 
below. □ 
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Lemma 3.9. Let S n (B) be defined as in (56). We have 



(59) 



lim lim sup nw n 1 S n (B) < e T / A(a)da. 

B->oo n-j.00 



Proof. Since the probability in the right-hand side of ( 56 ) depends only on I := y— x 
by translation invariance, we have 



S n (B) < 



w n +o(w n ) 
Bq n 



iec„{B) 



max Zi a > u r 

(i,i)6T B ,„(0,i) ' J 



where C n (B) = Bq n Zf~\[l n — S n , l^+5 n ]. The idea is now to apply to each probability 
Lemma |3.3| and replace Riemann sums by integrals. Introduce the function 



A„,s(a) = nlog 2 p n ■ '. 



max Zi j > u r 

(*.j)6TB 9 „(0,i„,B(o)) 



a > 0, 



where i n ,s(a) = max{7 G Bq n 7L : I < alog p n}. The function A„s(a) is locally 
constant and its constancy intervals have length Bq n / log p n ~ B/logn. It follows 
that 



(60) 



S„(B) < 



m„ + o(w„) 
£ 2 n 



logP i 
A l logP n 



X n>B (a)da. 



For every fixed a > 0, the sequence £„ = l n , B (a) satisfies the assumption of 
Lemma |3.3| Hence, by Lemmas |3.3| and|3.2| we have the pointwise convergence 



(61) lim \ nB (a)=e T A B (a), A B (a) 



1 



2 _ 4zi2 
-K2 2a 2 



2^ 



Also, by Lemma 3.4 X n>B (a) is bounded by a constant not depending on a, n, 
as long as a stays bounded from and oo. Applying the dominated convergence 
theorem to (60) we obtain 

(62) limsup my" 1 ^^) < e~ T / B- 2 A B {a)da. 

n— >oo J Ax 



Now we let B — > oo. Recall that H is a function defined by (40). It is known 
that lims^oo H(B)/B = 1; see [H P- 72] or [24, p. 232]. It follows from ^ that 
uniformly in a G [A\ , .Aj] , we have 



lim B~ 2 A B (a) 



q q-2 
-K22a 2 



2y/ira 2 



A(a). 



To complete the proof let B — > oo in (62 ). 



□ 



Lemma 3.10. Let S' n (B) be defined as in (57). We have 
(63) lim limmf nw^S'JB) > e~ T / A(a)da. 

B— foo n— 7-oo ' ' J A\ 

Proof. Analogous to the proof of Lemma |3.9| 



□ 



Remark 3.11. It follows from (59) and (63) that in both equations we can replace 
inequality by equality. 
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The next lemma is needed to estimate the "double sum" S'^(B). It states that 
the exceedance events over different intervals become asymptotically independent 
with exponential decorrelation speed as the symmetric difference of the intervals 
gets larger. Consider two intervals K\ — (ii,ji) G I and K2 — (12,32) G I with 
lengths k\ := j\ — i\ and ■= ji — *2 satisfying ki,k2 G [^,^] and such that 
*ii*2)ii)i2 G q„Z. Let fc G No be the length of the intersection K ;= K\ (1 K%. 
(More precisely, K is the intersection of the sets {zi + 1, . . . ,ji} and {i 2 + 1, . . . , .72})- 
Assume without restriction of generality that k\ < k2 and write A = A(Ki, K2) — 
k% — k. In some sense, A measures the distance between the intervals K\ and K2- 

Lemma 3.12. Given an interval Kq = (io>jo) G I define a random event 
F n (io,jo) = \ max Z t] > u n \. 

There exist constants C\,C 2 > (depending on A\,Ai but not depending on K\, 
K2, n) such that for every Ki,K 2 as above, 



P n (K 1 ,K 2 ) :=P[f„(ifi)nF„(if2)] < Cxu^e 



ul/2 e -C 2 A(K 1 ,K 2 )/q n 



3.3 



Remark 3.13. Note that F[F n (K e )} ~ const • u^e - "™/ 2 , e = 1,2, by Lemma 
and (38). The factor e~ C2A ^ Kl ' K2 ^ qn provides an estimate for the dependence 
between the events F n (K\) and F n (K2). 

Proof of Lemma \3.1H% Given a finite set I C Z let Sj = J2 me j X m . Any interval 
(*0;io) G I will be identified with the finite set {ig + 1, . . . , jo}. In particular, we 
need the random variables Sk 1 = Sj t — Si 1 and Sk 2 — Sj 2 — Si 2 . Introduce the 
random variables 

D i = n3^ ax ^ ~ ^i-fe). D t = max (S J1+k - S h ), 

D% = max (S i2 - S i2 - k ), Dt = max (S j2+k - S 32 ). 

0<k<q n 0<k<q n 

These random variables are corrections appearing when wc extend the base intervals 
K\ and K2 by small intervals of length at most q n . With this notation we have an 
inclusion of events 



F n {K e ) c {S Ke + D+ + D~ > Vk £ u n }, e = 1, 2. 

Denote by K\ = (i% — q n ,ji + q n ) the extended version of the interval K\. Note 
that the interval K\ contains all intervals from T 9ii (ii, ji). Let K — K 2 PiKi be the 
extended intersection of Ki and K 2 , and denote its length by k = \K\ < k + 2q n . 
Let A = k 2 - k be the length of K 2 \K = K 2 \Ki- Fix e > 0. Introduce the 
following random events 

E 1 = [% + D2 + D+ > \Tku n + 2^2q~ n + e Av / '" " 1 

{S Kl +D+ > V^Un}, 



E 2 



Ay/k 2 + 2q n 



{Sk 2 - S R > (\fk~2- \fk)u n - 2^[2q~ n - e- 

< q n u n > 

Note that F n (K t ) C E 2 . We have F n (K 1 )nF 2 (K 2 ) C E 1 U (E 2 (~)E 3 ). By construc- 
tion, the events E 2 and E 3 are independent. We will estimate the probabilities 
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of Ei, E 2 , E3. Bringing these estimates together will complete the proof of the 
lemma. First we estimate the probability of E 2 . By Lemma |3.4[ 



(64) 



2 = 



max 



Si Si 



We now estimate P[£y. Consider the case A > 10y/Aq n . Since k < k 2 < Alog p n, 
u n ~ \/2 logn, q n ~ log p ~ n, we can choose e > so small that the following 
inequality is valid: 



k)u„ 



2q n 



u„A 



2J2q rl 



Ay/k 2 + 2q n ^ eA 



> 



2.4 



qn u n yka + V k q-nUn 

Note that A < 3 A since A < A + 2q n and A > q n . With Lemma 
follows that 

zA 

\ 1 1 I !• < ci exp -j "• , 

fqn' > *- q n > 



'Qn 



and (34| it 



(65) 



P[£ 3 ] < 



5 A > 



c 2 A^i 



In the case A < 10 V Aq n we can use the trivial estimate P[£y < 1 and (65) remains 
valid provided that c\ is sufficiently large. 

We estimate P[i?i]. First we have to get rid of the correction terms D 2 and D 2 . 
By the inequality stated in Theorem 2.4 on p. 52 in for every ieKwe have 

F[Df >t]< 2P[S qn > t 



'2q n \. 

If X is any random variable which is independent of D 2 and has distribution 
function F, then 



F[X + D+ >t}= [ P[D+ > t - s]dF(s) 
Jr 



< 2 / F[Sg n > t - 



= 2F[X + S Qn >< - V2fo]. 



Applying this trick twice to D 2 and D 2 we obtain 
P[£i] 



< 4P 



£>k+2q n 



> 



"VkUn 



+ e 



AVfc 2 + 2q n 



< 



r £>k+2q„ 

\/k + 2q r 



>u n +e- 



qn^n 



qn^n 

where in the second inequality we used the relations q n ~ log p_1 n, u n ~ ^/21ogn, 
q n < k < k 2 . (The case k = can be excluded since Lemma 3.16 holds trivially 



in this case due to the independence of F n (Ki) and F n (K 2 )). Applying to the 



right-hand side Theorem 2.3 and then (34 1, we obtain 

A 



(66) 



q n u, 



■)} 



< C lU - L e 



-ul/2 p -C 2 A/q n 



The proof of the lemma is completed by recalling that F n (Ki) PI F n (K 2 ) C E\ U 
(i?2 H S3), where #2 and -E3 are independent, and applying (64), (65), (66). □ 

Given BeN consider a discrete d-dimensional cube Qb = {0, . . . , B — l} d . We 
can decompose the lattice 1 d into disjoint discrete cubes of the form Bu + Qb, 
u E Z d . Given vi,v 2 € Z d we write vi ~ «2 if vi and U2 are in the same cube in 
this decomposition, that is if there is u £ Z d such that v±,v 2 G -Bu + Qb- Even 
though this is not explicit in our notation, the relation ~ depends on B. Given a 
set V C Z d let <9 B V be the -B-boundary of V defined as the union of discrete cubes 
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Bu + Qb, u G Z d , which have a non-empty intersection with both V and TL d \V . 
Let || • || be the sup-norm on K d and |V| the cardinality of a finite set V. 

Lemma 3.14. Let V„ C 1 d be a sequence of finite sets such that for every B G N, 
|9sV n | = o(|V„|) 7 as n — > oo. Then, for every e > 0, 

1 



lim lim sup -. 

B-yoo n -yoo |V„ 



E 



,- J I'l-f! — 



0. 



i)i,i>2£v, 

Ul 



Proof. Take any sequence /cb € N such that fc^ — > 00 but = o(B) as B — > 00. 
We have a disjoint decomposition of the cube Qb into the "kernel" Q' B and the 
"shell" Q" B defined by 

Q^ = {fc B ,...,S-fc B } d , Q b = Qb\Q'b- 
We have also a disjoint decomposition Z d = Z' B UZ B , where Z' B = U ue zd(Bu + Q' B ) 

an.i /:;, /.),. 

Consider any v\ G Z' B . Then, for every V2 G Z rf such that v\ & V2 we have 
IK'2 — Vi|| > k(B). Hence, 



(67) 



E 



e - e ||ui-« 2 || < ^ e - 

«2 6V„:i>i:X<l>2 V-2&l, d :\\v2\\>kB 
7 II 



Consider any v\ G Z B . There is a constant C < 00 (not depending on v±) such that 



E 



< 



^2 :di q^^2 



E 



E 

v 2 ez d 



e -e||Di-t)2|| _ ^ e -e||D2|| _ (J 



It follows from |5 B V n | = o(|V„|) that nV n |/|V n | -> 1, |Z£nV„|/|V„| -> 0, 
and |V n | — > 00, as n — > 00. Using (67) and (68) we obtain 



lim sup 



1 



E 



= C B . 



»i,"2£V, 

V\ 0OV2 



To complete the proof note that lims-j-oo C_b = by (67). 
Introduce the finite set 



□ 



(69) 



n 



{(*,v) 



e 



X G 





, y-xe 


l n 11 


L g« - 




-In In- 



Lemma 3.15. There is c> smc/i that for allvi, vi G V n we /ia?;e A := A(i>i, w 2 ) > 
c||t>i — U2II, where A(ui,u 2 ) was introduced before Lemma 3.12 

Proof. Let v = (11,2/1), v 2 = (2:2,2/2), where xi < y% and x 2 < y 2 . Then, ||ui— u 2 || = 
max (| 2; 1 — X21, 1 2/x — 3/2 1 ) ■ Without restriction of generality, let X\ < If £2 < yi < 
7/2, then A = u 2 || by definition. If x 2 < 2/2 < 2/1, then A = |x2— ^i| + |j/2— 2/i| > 
II - wall- Finally, if y 1 < x 2 , then A = max(i/i - x Xl y 2 - x 2 ) > l~/q n > C\ logn 
by definition of V„ and \v\ — w 2 | < w n /q n + l^/q n < C2 logn. □ 

Lemma 3.16. Let S'^{B) be defined as in ([58]). We have 



lim limsupnii; n S n [B) — 0. 

B->oo n-j.00 
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Proof. Take first some fixed B € N. Given a vector v — (x,y) £ K 2 we write 
v' = (x + 1,2/). Let (a;,y) G Bq n 7* 2 , x < y, be some interval. Then, we can 
represent the discrete square T Bq n (x,y) as a disjoint union of B 2 squares of the 
form Tg n (q n v'), where 



v £ v + {0, . . . , B - l} 2 



Bq n y_ 



£ Bl?. 



We can estimate the exceedance probability over T s qn {x,y) by the sum of the 
exceedance probabilities over the Tg„'s. It follows from (581 that 



Vl,V2 



-(M)6T,„(g„«i) 



where the sum is taken over all V\ £ Y n and v 2 £ V n such that v± no v 2 . Applying 
Lemma 3.12 and noting that A.(q n v[,q n v' 2 ) = q n A(vi,V2) we get 



C3 



«1 ,"2 



dog 2 - p i 



e — GiH-ui— t!a|| 



Here, || • || denotes the sup-norm and the second inequality follows from (36 1 and 
Lemma 3.15 Applying to the right-hand side Lemma 3.14 and noting that |V n | < 
Cw n log ~ p n we arrive at the required statement. □ 



3.4. Global probability. Recall that for < A\ < A 2 we define l~ = A\ log p n 
and = A 2 log p n. Denote by I n (Ai,A 2 ) the set of all intervals £ I„ with 

length / := j — i £ Our aim is to prove Theorem 1.2 which states that 



(70) 



lim P 

n— >-oo 



max Zi j < u r 

(iJ)6l„(A 1 ,A 2 ) ,J 



exp 



| ~ e ~ T J 2 A(a)da}. 



We will decompose the set I r! ,(v4i, A 2 ) into sets of the form J„(z); see Lemma 3.8 



Let w n = [3A 2 log p n] . To get rid of the boundary effects choose a sequence r\ n > 
such that rj n = o(n) but rj n /w n — > oo. Consider the one-dimensional grids 

fc'n = hVn, n + T) n ] n w n Z, Vj' n = [rj n , n - n io„Z. 

Note that the sets J ra (z), where z £ 7Z' n , are disjoint and cover l n (Ai,A 2 ). Simi- 
larly, the sets I n (z), where z £ TZ'^, are disjoint and contained in I n (Ai, A 2 ). The 
exceedance probability over each S n (z) satisfies, by Lemma 3. 



(71) 



max Zi a > u ri 

(»,j)6Jn(*) 



■Wn 

n 



h(a)da, n 



oo. 



Also, \R' n \ ~ |7£^| ~ n/w„ as n — > oo. If the exceedance events over J„(z) were 
independent, the Poisson limit theorem would immediately yield (70). However, the 
events are dependent. In fact, the dependence is quite weak: the exceedance event 
over J„(z) depends only on the two neighboring exceedance events over J n (z± w n ), 
if n is large. To justify the use of the Poisson limit theorem for such finite-range 
dependent events we need to check that (see, e.g., |H Thm. 1]) 



(72) 



max Zi j > u„ 
0',j)eJn(0) ,J 



(i,j)eJ„(™„) 



ZiJ > W77 



(Wn 
O — 
V n 
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Since we can apply Lemma 3.8 to the set J„(0) U J„(w n ) (replacing w n by 2w n ), 
we have 

fA 2 



(73) 



max 



. 2w r , 
n 



A(a)da, n — > oo. 



Combining (73) and (71) we obtain the required relation <|72J) . Thus, the use of the 
Poisson limit theorem is justified. The proof of (70) is complete. 

3.5. Non-optimal lengths. We will now estimate the exceedance probability over 
all intervals (i, j) £ I„ which are non-optimal in the sense that their length I := j —i 
is not between l~ = A^ 1 log p n and i+ = A\og p n, where A > is large. Denoting 
by M n (A) the set of all such intervals, we will show that 



lim limsupl 

A >OQ n ^QQ 



max Zi j > u r 



0. 



This means that the contribution of M n (A) to M„ becomes negligible as A — » oo. 
Combining this with Theorem 1 1 . 2 1 proved above, we obtain Theorem |1.1| We will 
decompose the set M n (A) into three subsets B'„, B", B"' and estimate the exceedance 
probabilities over these sets in the next three lemmas. We start by considering very 
small intervals. 

Lemma 3.17. Fix an arbitrary a > 0. Let M' n be the set of all intervals (i,j) £ I n 
whose length I = j — i satisfies I < alogn. Then, 



lim P 



max Zi ,• > u ri 



= 0. 



Proof. Let i 6 N be such that / < alogn. Recall from (36) that u n ~ v^logn, 
as n — > oo. Then, u n j\fl > l/y/a, for all large n. Consequently, by ( |34| , there is 
5 > such that for all large n and all I < alogn, 

log n 



I 



By Lemma 2.4 the exceedance probability for every individual interval from 
satisfies 



Si_ 
Vl 



< 



exp 



Since the number of intervals in B^ is at most an log n, we obtain the statement of 
the lemma. □ 

Lemma 3.18. Fix any a > 1. Le£B" = B"(yl) be the set of all intervals (i,j) £ I n 
whose length I = j — i satisfies alogn < I < A~ x log p n. Then, 



lim limsupl 



max Zi j > u, 
(i,j)eB»CA) ,J 



= 0. 



Proof. For k £ No let M" k be the set of all intervals (i,j) £ I n whose length 
I satisfies 2" (fe+1) log p n < I < 2~ fe log p n. We can cover the set B^ fe C 1? by 
disjoint discrete squares T Tnk (x, x + I), see Section 3.2 with side length r n ± '■— 
[2~ k log p ~ 1 n] , at least as long as 2 k < log p_1 n. The number of squares we need is at 
most ci2 fe (log 2_p n)n. The exceedance probability over any such square T Vn k (x, x+ 

with r 



I) can be estimated by Lemma 3.4 

„,2 



r n ,k, u 



c 2 u n exp • 



| < c 2 u n 1 exp • 



and is at most 

,2 



- c 4 2 
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Here, C2,.. • ,04 do not depend on n, k. We can cover the set M'^(A) by the sets 
!>« k , where k is such that A < 2 k < a -1 log p_1 n. For the exceedance probability 
over the set B"(A) we obtain the estimate 



max Zi j > Un 



< c 5 (log 2 - p n)^-^^-""/ 2 



E 

fc=[log 2 A] 



2 k e 



< c 6 ^ 2 k e~^ 2 . 

fe=[log 2 A] 

In the last inequality we have used that u„e u ™/ 2 > C7(log 2_p n)n by (36). To 
complete the proof note that the right-hand side tends to as A — »• 00. □ 

Lemma 3.19. Let B"' = M'^(A) be the set of all intervals € I n whose length 
I = j — i satisfies I > A log p n. Then, 



lim limsupl 



max Zi j > u r 
(i,i)£B»'(A) ' J 



0. 



Proof. For A: € No consider the set B"' fe of all intervals (i,j) € I„ with length Z 
satisfying 2 k log p n < I < 2 k+1 \og p n. We can cover the set B"' fc e Z 2 by disjoint 

with side length r n k := [2 k log p_1 n]. 



3.2 



discrete squares T rn k (x,x+l), see Section ', 

We need at most ci2~ fc (log 2_p n)n squa res. Exceedance probability over any sin- 
gle square can be estimated by Lemma 3.4 by c 2 u~ 1 e~ u "/ 2 . For the exceedance 
probability over the set M' 7 ['(A) c U^Lj log2 A ^'n k we obtain the estimate 



max Zi j > u r 



< c-iu^e^l 2 -nlog 2 ^; 



OO 

E 



1 

2*' 



-< C4 x; 

fc=[log 2 A] fc=[log 2 A] 

The right-hand side goes to as A — > 00. The proof is complete. □ 

3.6. Proo f of Prop osition | 1.8[ We assume that we are in the setting of Sec- 
Let 7 = — p)/p. Note that the X^s take values 7 and — 7 _1 . First 



1.5.2 



tion 

we will show that 
(74) I(s) 



'2k 



00 

E 4fc(2fc - 1) 



,2fc-l 



,l-2fc 



00 

E 

k= 



as 



2k+l 



a 

t= 2 



Taking into account ( |20| and solving ip'(t) 

log + 7s) 

for s £ (— 7~ 1 ,7). For s in this range, 
(75) 1(b) = st- <p(t) =p(l + 7s) log (l - 



^ 4fc(2fc 

s, we have 

log (l - - 
V 7 



+ 1) 



n (T»-7»). 



7s 



(l-p)(l->g(l-^). 



For s outside the interval (—7 ,7) we have I(s) = +00. By Taylor's expansion: 



log (l + 7 s ) 



OO 

k=l 



7 



fc=i 



Inserting this into ( |75| ) we obtain (74). If p e (1/2, 1), then < 7 < 1 and hence, 
it follows from (74) that all coefficients in the Taylor expansion of I(s) — (s 2 /2) are 
non-negative (and in fact, the coefficient of s 3 is strictly positive). It follows that 
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3.1 



this 



J(s) > s 2 /2 for all s > 0. This implies that ([34]) holds. By Proposition 
implies Q . Together with the Taylor expansion in ( 20 ) this shows that we are in 
the super logarithmic case with q = 3. 

4. Proof in the logarithmic case 

Our aim in this section is to prove Theorems |1.3| |1.4| |1.5| Assume that con- 
ditions Q, (13), (14) hold. Fix r £ R and define the normalizing sequence 
u n = u n (r) > by 

(76) u 2 n = 2m* (log 7i + t). 

Our aim is to compute the limit of P[M„ < u n ], as n — > oo. 



4.1. Dual conditions. First of all, we need to replace conditions (13) and (14) by 
their Legendre-Fenchel conjugates. We will assume that there is s* > such that 



(77) 

and, additionally, for every e > 0, 
(78) ini' 



1(8*) 



1 



< 1 



o<s<s,-e s 2 /2 



and inf 



I( S ) 



> 



>s*+e s 2 /2 

Proposition 4.1. Suppose that Q holds. Then, conditions (13) and (14) imply 
conditions (77) and (78) and vice versa. Furthermore, if these conditions hold, 
then we have (/?(£*) = /(s*) = s*f*/2 and 



(79) 



3* = if/ (i^) = i^m*, t m — /'(s*) = 



Proof. Assume that (13) and (14) hold. Define = (p'(t*). We will show that (77) 
and (78) hold. By Legendre-Fenchel duality, I' is the inverse function of ip' and 
vice versa. Hence, t„ = I'(s*). The point t* is the unique maximum of the function 
py| by (13) and ( |14[ ). The derivative of this function vanishes at t„ and hence, 
y'(i*)f* = 2</?(i*). In view of (13) this implies that s* = t*m*. Since the maximum 
of — y?(t) is attained at t — t*, we have, see (29), 



I(s*) = <p'(t*)t* — <p(t*) = (p(t*) 



m*tt 



2 



2 2 2m* 

The inverse function of ip' is I'. Taking the derivative we obtain I" (s*)ip" (t*) = 1. 
This proves (Pffb and ([791. 



We will now show that condition (78) is fulfilled. Fix e > 0. Denote by S^u) 
the set {s > : \s — u\ > e}. By ( |l4"| , for every <5 > we there exists c = c(5) < 1 
such that (p(t) < cm^ for all t £ Ss(t„). Then, for every s £ S^s*) and every 
5 > 0, 

(80) /(s) = sup(si - tp(t)) > sup (st - tp(t)) > sup 
t>o tes 5 (t*) tes s (t*) 



^-c(i)m,y 



The supremum of st — c(S)m* \ is attained at t = e( .^ m 
check that t £ Sg(t*). Recall that \s — s*| > e. It follows that 



However, we have to 



t-U = 



c(5)m, 



-t. 



s — c(S)s* 
c(8)m le 



>S, 
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where the last inequality holds if 8 = 5(e) > is sufficiently small. (Note that 

2 

lim^o c(5) = 1). In this case, t £ Sg(t*). It follows from ( 80| that I(s) > 2c (S)m 
for all s G 5 e (s*). This proves j7§. The proof that Q and ([78) imply @ 
and (14) is analogous, by the Legendre-Fenchel duality. □ 



4.2. Individual probability. In the sequel, we assume that conditions Q, (77), 
(78 1 hold. In this section we compute asymptotically the exceedance probability for 
the value attained by the random field Zi j at some individual point G I. We 
focus here on intervals whose length is close to the optimal length d* logn, where 



(81) 



1 



1 



2m* 



ip(U) J(s«) 



It turns out that the exceedance probability remains the same, up to a constant 
factor, if we allow fluctuations of the interval length of order 0(y/\ogn) and fluc- 
tuations of the threshold of order 0(u^ 1 ). 

Lemma 4.2. Assume that Xi is non-lattice. Let l n G N be any sequence such that 
l n = d* log n + a\/\ogn + o(\/log n), for some a£R, as n — » oo. Fix s G R. Then, 
as n — > oo, 



:= 



Si 



s 

?/,, 



ny/log n 



Here, a* = y/(p"(t,) and /3* 2 = ^-(^r - ^-). 
Proof. We are going to apply Theorem 



/'(a) = = s*/m* by Proposition 
Theorem [231 



4.1 



2.2 



with a := lim„_ 



= s*. Note that 



and ct (a) = <p"{U) = crj. We obtain, by 



(82) 



Pn(s) 



2(7* 7T log i 



: exp 



Next we develop the term under the sign of exponential in (82 ) into a Taylor series. 
Consider the function J(v) = vl(l/y/v), v > 0. By assumptions (77), (78) it has a 
unique minimum at u* := s~ 2 . The first two derivatives of J are given by 

(83) rw = /(') ' /■('), A») = 3^''(i)-j^'(-U 



For the values of J and its derivatives at v = i>* we obtain 

(84) J{v.) = ^-, Av*) = o, J"M = s ^a-^-)>o. 



Now consider 



In 



s \2 



d* log n + aylog n + o(yTogn) 
2m* log n + o( v / log ra) 



a + o(l) 
2m* V / I°g n 
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Expanding J into a Taylor series at v — i>* , we obtain 



J(v n )(u n 



s 



1 



2m, 
log n - 



1 a 2 

2 4m 2 log n 

2 m* 



J" (2m* (log n + r) - 2s) + o(l) 
+ o(l). 



To complete the proof insert this into ( 82 1 . 



□ 



4.3. Local probability. Next we compute the exceedance probability over a dis- 
crete square in the space of intervals. Recall from Section |3.2| that for an interval 
(x,y) G I of length I := y — x and B £ N we define Ts(x,y) to be the set of all 
intervals £ I such that 

x — B < i < x and y < j < y + B. 

The set Tb{x, y) is a discrete square with side length B in I C Z 2 . Its right bottom 
point is the "base interval" (x, y) which is contained in all other intervals belonging 
to T B (x,y). 

Lemma 4.3. Assume that Xi is non-lattice. Fix B £ N and let l n £ N be a 
sequence such that l n — d* logn + a^/log n + o(\/\og n), for some a £ R. Write 
TT rl = Ts(x, x + l n ). Then, as n — > oo , 



(85) 



max Zi j > Un 



F„(0)-{l + ^ 2 (B)}, 



where P n (0) is as in Lemma 4-2 and the function : N — > (0, oo) is defined by 

J,S k -kip(t,) 



(86) 



H* (B) = E 



max e 

k=0....,B-l 



Proof. The proof follows the same idea as the proof of Lemma [3T3| but the incremen- 
tal process will be approximated by a discrete-time random walk rather than by a 
Brownian motion. Let and be independent random walks defined as in the 



proof of Lemma 3.3 Define a random variable V n by Z XjX +i n — u n — u~V n . Every 
interval from T„ has the form (x — k\, x + l n + fo) for some integers < k\ , k 2 < B, 
hence 



Or. 



max 

0<fei,fe 2 <B 



y/l n + k\ + k 2 

Conditioning on V n — s and integrating over s, we obtain 



> Mr. 



(87) 



poo 

Qn=Pn(0)+ / G n {s)dfX n (s), 

Jo 



where fi n is the probability distribution of V n and G n is a non-increasing function 
defined by 



(88) G n (s) = P 



max 

0<fei,fe 2 <J 



(Sj£ + S S - Un(Vln+h+k 2 - y/Q - ^) 



> 



By Lemma 4.2 for every s > 0, 
(89) 



hm = Um P »« 



Pn(0) 



Pn(0) 
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Let s n be any sequence converging to s > 0. We compute lirrin-^oo G n (s n ). Let 
f n (ki, k 2 ; s n ) be a function given by 



fn{ki,k 2 ;s n ) = u n (y/l n + k 1 + k 2 - yfQ) 



Recall that u n ~ y/2m* logn and l n ~ c?* logn, asn-> oo. An elementary calculus 
shows that 

lim /„(fci,fe 2 ;s„) = -<f (fa + k 2 ) + — ■ 
Since the a.s. convergence implies the distributional convergence, we obtain that 

max (S^ +S?J - f n (h,h;s n )) — ^ 2 - — , 

where U\, U 2 are two independent copies of the random variable 

U := max (t*5fc — k<p(t*)). 

k=0,...,B-V 

It follows that for all but countably many s > 0, and all sequences s n — > s, 



(90) 



lim G n (s n ) 



U 1 + U 2 > 



Assuming for a moment that interchanging the limit and the integral is justified, 

a ds 



we obtain from ( 89 ) and ( 90 ) that 



(91) lim / G n (s) 



Pn(0) 



U 1 +U 2 > 



e». — = (Ee 

TO* 



U\2 



Inserting this into (87) completes the proof of Lemma 4.3 



The first equality in (911 will be justified using Lemma 3.5 To verify its last 
condition we have to obtain uniform estimates on G n and see Remark |3.6| 
Continuing (88) and recalling that \fl n ~ u„/s*, we obtain that for all large n, and 
all s > 0, 



(1), c (2)s SVinl 



G n (s)<P max (S£ J + Sg) > 
la<k 1 ,k 2 <B Kl K2 



< 



max > —— 

0<k<B K 3s* 



By Lemma 2.4 and (77) we obtain that for some constants c\,c 2 > 0, all large n 
and all s > 0, 

B-i 

(92) G„(s) < 2 ex P { - fc/ (^fc) } ^ c i e " C2s2 - 



Now we bound /x n ([0, s)). Let first s <E u^]. Using Lemma 2.4 and (77) we 

obtain that 



Sin s 

—= > u n 



< exp < — 



Let now s € [0,u*/2]. Using Theorem 2.3 and ((77 1 we obtain 



Sl n ^ 

—7= > U n 



< Cu n 1 exp { - — (u n - — ) }• 
I 2m* V u n > > 



For 5 > u n we can estimate the probability by 1. Combining all cases we obtain 
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Together with Lemma 4.2 this implies that /Lt„([0, s]) < Ce s / m * P n (0) for all large 
n£N and s > 0. Conditions of Remark 13. Glare thus verified. □ 



4.4. Estimating the double sum. Given real numbers A\ < A 2 define l n 



d* \ogn + Aiy/logn and Z 
prove the following result. 



c?* logn + A2\/log n. The aim of this section is to 



Lemma 4.4. Assume that X\ is non-lattice. Let w n — > oo be any integer sequence 
such that w n = 0(\ogn). For z € Z let l n {z) be the set of all intervals (i,j) € I 
such that z < i < z + w n and j — i € [l~ , Z+]. Then, as n — > oo, 

rA 2 



(93) 

where 0(a 
(94) 



max Zi j > u r , 

(*,i)eJ»(*) 



Q(a)da, 



A 1 



2v/7TCT» 6 



2 , a£R, and the constant is given by 



H , = lim = lim ±-E 



max e 

fc=0,...,B-l 



t,Sk — k<p(t t ) 



e(o,i). 



Proof. The existence of the limit in (94) follows from by taking Yf. = t*Xf. — <p(t*) in 
Lemma 4.16 below. In fact, (94 1 can be also obtained as a byproduct of the double 
sum argument presented below. We prove (93). Without restriction of generality, 
let z = 0. To get rid of the boundary effects we introduce two sequences e n — > oo 
and 5 n — > oo such that e n = o(w n ) and S n — o(y / logn) as n — ¥ oo. Take some 
B 6 N and let q n — 1. Then, with the same notation as in (53), (54), (56), (57), 
( 58 1 , we have the Bonferroni inequality 

S'JB) - S"(B) < Pf max Zi j > u r , 



(95) 



< S n (B). 



The statement of Lemma |4.4| follows by letting n — > oo and then Z? — > oo in ( 95 ) 

□ 



and applying Lemmas |4.5| |4.6[ |4.9| which we will prove below. 

Recall that w n — > oo is an integer sequence such that w„ — O(logn) as n 



Lemma 4.5. Let S n (B) be defined as in (56) with q n = 1. We have 

rA 2 



(96) 



lim lim sup nw n S n (B) < e T 



<d(a)da. 



Proof. Since the probability in the right-hand side of ( 56 ) does not depend on x, 
we have 

S n {B) = Wn + ° {Wn) ^ 



B 



iec n (B) 



max Zi j > u n 

(i,j')6Ta(0,0 



where C n (B) = B7L D [Z~ — 5 n >Z„ + <5 n ]. The idea is to apply to each probabil- 
ity Lemma |4.3| and replace Riemann sums by Riemann integrals. Introduce the 
function 



^n.B(a) = n x/log n ■ 



max 

(i,j')eT B (0,i„ 



a e 



where Z„.s(a) = max{Z € BZ : I < d*logn + aylogn}. The function nj B(a) is 
locally constant and its constancy intervals have length B/y'log n. It follows that 



(97) 



S n (B) < 



w n + o(w„) 
B 2 n 



A 2 + - 



Jn.B 



(a)da. 
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For every fixed a G R, the sequence l n = l n ,B{a) satisfies the assumption of 
Lemma [473} By Lemmas |4.3| and |4.2| for every a G R, 



./ml Aa 2 „ 
lim e n , B {a) = e- r e B (a), Q B {a) = ^=^ e -— (1 + H*(B)). 
n->oo Z\/ 7ra '* 

We also need an estimate for # nj _B(a) which is uniform in a. Assume that a G [— c, c], 
for some c > 0. For every interval (i, j) G T B (0, l n ,B{ a )) 01 length Z we have, by 
Theorem [Z3| and fff| , 

P[Z tJ > tin] < Cu-' exp { - Il(^) } < -£= exp { - A} < — 

Since Tb(0, Z n ,s(a)) consists of intervals, we obtain that 9 n ,B(a>) < C for all 
a G [— c, c], where C does not depend on n and a. Taking the limit as n — > oo 
in (97) and applying the dominated convergence theorem, we obtain 

<-A 2 



(98) 



limsup nw^ S n (B) < e T B 2 9fl(a)(io 



This holds for every Bet We let B -> oo. The limit if* := lims^ B^H^B) G 
(0, oo) exists by (94). Hence, limB-^ B~ 2 <ds(a) = 8(a) uniformly in a G R, where 
0(a) is defined as in Lemma 4.4 To complete the proof let B — > oo in (|98|. □ 



Lemma 4.6. Le£ S' n (B) be defined as in (57) with q n — 1. We have 



(99) 



lim liminf nw n S n (B) >e T 

B— >-oo n— ^oo 



8(a)da. 



Proof. Analogous to the proof of Lemma |4.5 



□ 



Remark 4.7. It follows from (96) and (99) that in both equations we can replace 
inequality by equality. 

The next lemma is needed to estimate the "double sum" S„(B). It provides 
an estimate for the correlation between exceedance events over different intervals. 
Consider two intervals K\ = (ii,ji) G I and K 2 = (12,32) G I such that fci := 
ji — ii G [irti'nl an< ^ ^ 2 : ~ -? 2 — «2 G ^n]- Let K be the intersection of K\ 
and i^2- Denote by k G No the cardinality of K. Assume that ki < k 2 and let 
A = A(K 1 ,K 2 ) — k 2 ~ k. 

Lemma 4.8. There exist C\,C 2 > not depending on n, r such that for all 
intervals K\ and K 2 as above and all |r| < \/\ogn, 

(100) 



— — e 
nVlog n 



C 2 A(Ki,K 2 ) 



Proof. Fix e > 0. The event {Zi 1 j 1 > u n ,Zi 2 j 2 > u„} is contained in the event 
E x U(E 2 nE 3 ), where 



£2 = {Sid > Vhu n }, 



VkA^ 

Ur, > 



E3 = {S K2 \K > [\fk~2- ^/k)u n - e 



VkA- 
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We estimate P[.E7i]. Let first u n > 2s* k. Using Lemma 2.4 and (78) we obtain that 
there is 6 > such that 



P[JSi] < cxp| - kl(^ 



\fk 



}<cxp{-I±^ + 2 £ A)}. 



Now let u n < 2s*fc. We have, by Theorem 2.3 and (77), 

P[ El ] < C„-> exp { - kl(^f^-) } < ^ «p { 
Combining both cases we obtain that 



:>£ 1 A\^ . C r {uj + 2eA) | 

2m* i 



(101) 



P[£i] < 



Ce 



n-y/log n 



-eA /m* 



We estimate P[-E 2 ]- By Theorem 2.3 and (77), 



(102) F[E 2 }<C U - 1 exp{-kl(^)} 



C 



-xp{-> }< 
L 2m* J 



n^log n 



We estimate P[-Ea]. We have, using that k 2 ,k < l„, 



"5a 
"A 



> 



ffa + Vk 



< 



5a "n 

A 2\//+ 

^ V l n 



We can choose e > so small that — £— > e. It follows by Lemma 
(103) 



2.4 



that 



¥[E 3 ] < P[5 A > eA] < e~ A/ ( £ 



Here, J(e) > 0. The probability on the left-hand side of ( 100 1 is not larger than 
P[_Ei] + P[i? 2 ]P[i?3] since the events E 2 and E 3 are independent. Combining (101), 
(102), (103) we obtain the required estimate. □ 



Lemma 4.9. Let S'^{B) be defined as in (58) with q n = l. We have 

lim lim sup nw~ 1 5" ( B ) = . 

Proof. Introduce the finite set V n = Z 2 n([0, w n ] x /+]). Take some fixed BeN. 
Let (x,y) € BZ 2 , x < y, be some interval. The discrete square Tb(x, y) consists of 
B 2 intervals. We estimate the exceedance probability over Tb(x, y) by the sum of 
the exceedance probabilities over these intervals. It follows from (]58|) that 



where the sum is taken over all v± € V„ and v 2 G V„ such that Vi <*> v 2 . Applying 

C 



Lemmas 4.8 and 3.15 we get 
C 



n^Iogn 



e -c 2 A(v 1 ,v 2 ) < 



nVlog n 



E 



, — C4 1/1 — 1>2 



Here, 



is the sup- norm. Applying to the right-hand side Lemma 3.14 and noting 

□ 



that |V„| < C\w n Vlog n we arrive at the required statement. 
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4.5. Global probability. Given A\ < A 2 recall that l~ = logn + Ai-^/logn and 
Z+ = rf* logn + A 2 y / \ogn. Denote by l n ( Ai,A 2 ) the set of all intervals £ I„ 
with length I :— j — i £ [1^,1+]. Theorem 1.5 states that 

(104) lim pf max Z, , < u n ] = exp ( - e~ T / 6 

n^-oo L( lJ )eI„( J 4i,A 2 ) ' J J L 



(a)da^. 



The proof of (104) goes as follows. Let w n = [3d* log n]. We decompose the set 
I n (Ai,A 2 ) into ~ n/w n sets of the form I n (z), z £ w„Z. The exceedance proba- 
bility over any of these sets is asymptotically equivalent to e~ T (Jj^ 2 Q(a)da)w n /n 
by Lemma 4.4 Also, the exceedance event over I n (z) is independent of all other 
exceedance events except for J„(z ± w n ). Justifying the use of the Poisson limit 
theorem we obtain (104). The proof, up to trivial changes, is the same as in Sec- 
tion [320 



4.6. Non-optimal lengths. In this section we complete the proof of Theorem |1.3| 

For A > write l~ = d* logn — Ay/\ogn and = d* logn + A^Jlogn. Denote by 
M n (A) = I n \I n (— A, A) the set of all intervals £ I n whose length I := j — i 

satisfies I ^ [l~,l+]. The aim of this section is to show that the contribution of 
these non-optimal lengths to M n is negligible, if A is large. More precisely, we will 
show that 



(105) 



lim lim sup I 

A >co n—too 



max Za j > u 



= 0. 



Combined with Theorem 1.5 proved above this yields Theorem 1.3 We will cover 
the set M n (A) by three sets B^,B",B^' (depending on some further parameters) 
which will be considered separately in the next three lemmas. In this section 
we don't need the non-lattice assumption. First we consider intervals which are 
sufficiently small but not close to the optimal length d* logn. 

Lemma 4.10. Let < e < d* be fixed. For S > denote by M' n = M' n (e,S) 
the set of all intervals £ I n whose length I = j — i satisfies I < n s and 

\l — dst lognj > elogn. Then, we can choose 6 > so small that 



lim P 

n— f 00 



max Zi j > u,- 



0. 



Proof. Let I £ N be such that I < n 5 and \l — d* lognj > elogn. Then, we can find 



an E\ > (depending on e, but not on S,Ln) such that |^ — s»| > £\. By (78) 
there are 8\ , 8 2 > such that for all large n and all / as above, 



WW (u„M) 2 /2 2 l m * V 2 



U 



Using Lemma |2.4| we obtain that for all £ 
probability can be estimated as follows: 



F[Z itj > u n ] < e -"(«»M) < n 



% , the individual exceedance 

-(l+5 2 ) 



Choose any d £ (0,5 2 ). Since the number of intervals in 
statement of the lemma follows. 



is at most n + , the 
□ 



Next we consider intervals whose length is close to being optimal. 
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Lemma 4.11. For e > and A > let B" = B"(^4) be the set of all intervals 
E I n whose length I = j — i satisfies (d* — e)logn < I < l~ or 1+ < I < 
(d* + e) logn. Then, we can choose e > so small that 



lim limsupl 

A ^OO n ^QQ 



max 



> U r , 



0. 



Proof. It follows from (84) that we can choose e,5 > so small that for all v € 

(u* e), 

1 



(106) 



J(«)> 



2m, 



+ 5{v - v*y 



Here, = 2 . For k E No let B^ fe be the set of all intervals E I n with length 
I satisfying 

d„ log n + 2 k ^\ogn < I < log n + 2 fc+1 ^/^ogn. 



It follows from (106) that for every € 

„,2 



tt_n 



= U„J 



I 



> 



2m* 



d*logn + 2 fe Vlog 



rn - 



> logn + ci2 2fe . 



By Theorem 2.3 we obtain 
J St 



< 



C3 



-C4 2 



n\/log n 



Let B'^ be the set of all intervals E I n such that 2+ < i < (d* + e) logn. 
The number of intervals in B" k is at most c^2 k ny/\iSgn. For the exceedance prob- 
ability over the set B" , (A) we obtain the estimate 



max 



^ u r. 



00 



< c 6 2^ 2 e 

fc=[log 2 A] 



fc -c 4 2 2 



The right-hand side goes to as A — > 00. Exceedance probability over the set 
B" _(A) consisting of all intervals E I n with length satisfying {d* — e) \ogn < 
I < l~ can be estimated analogously. □ 

Lemma 4.12. Let 5 > be arbitrary. Let B"' be the set of all intervals E I n 
whose length I = j — i satisfies I > log 1+S n. Then, 



lim I 

n— f 00 



max Zi j > u n 

(*j)eB^' 



0. 



Proof. For fc G N such that 2 k > logn let B^' fc be the set of all intervals E I„ 
with length 2 satisfying 2 k < I < 2 k+1 . W e can cover this set by discrete squares 

where = [2 fc /logrt] and x,l E r^Z. 



4.3 



of the form T rk (x,x + I), see Section 

We need at most c2~ k n\og 2 n such squares. The exceedance probability over each 
such square can be estimated using the same method as in Lemma |3.4| For every 
e > there is rj > such that I(y) > (1 — e)y 2 /2 for all y E [0,7/]. Rec all t hat 
I > log 1+s n and hence, u n /\fl < rj for all sufficiently large n. By Lemma 2.4 for 
every s E [0, u 2 ], 

5', 



.00 := 



>u n 

VI u r, 



< exp < — 



< e -exp{-(i^«}. 
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This inequality continues to hold for s > u„, since in this case the right-hand side 
is greater than 1. Arguing in the same way as in the proof of Lemma |3.4[ but 
replacing ( 49 ) by the above inequality, we obtain 

max Z. 



> u. 



< Cexp 



For the exceedance probability over the set m '" 



{_(i-«K}< 0w -(i-ho, 



max Zij > u ri 



ii . k 



< C2 



obtain 



— h — f - 



To complete the proof, take the sum over all k £ N. 



□ 



4.7. Tightness in the l attic e case. Now we allow the distribution of X\ to be 
lattice and prove Theorem |l.4| The tightness of the sequence M 2 — 2m* log n follows 
from Lemmas 14.131 and 14.151 below. We use the same notation as in Section 14.61 



Namely, for A > we write l n = d* log n — A^/\og n and Z+ = d* log n + Ay/log n. 
Let I n (A) be the set of all intervals € I n with length I = j — i satisfying 

I G [1^, It]- Writc M n(A) = l n \l n (A). Let u„ = u„(t) > be defined by 

(107) u 2 n {r) = 2m,(logn + T), rel. 

Lemma 4.13. For every e > we can find t = r(e) such that P[M„ > u„(r)] < e 
for all large n. 



Proof. Fix e > 0. By ( 105 ) we can find A = A{e) such that for large n, 

e 



(108) 



> 1tn(0) 



< 



-(i,j)6B„(A) 

We estimate the exceedance probability over I n (A). Let t > 0. For every interval 
€ I„(A) with length / 6 [C'C] we have, by Theorem 2.3 and (78 1, 

.(r^C^expj-Z/Q)} - ° 



> U r , 



< 



■ exp 



I 2m* J 



Ce~ 



Vlogn ^ L 2m* J n^log n 

Since the number of elements in I„(j4) is at most 4An^/log n, we obtain that there 
is C\ depending only on A such that 



(109) 



Zi,j > Un(r) 



We can choose r > so large that C\e~ T < e/2. To complete the proof com- 
bine ( [Top ) and ( |T09"1 ). □ 

We now give a lower estimate for the exceedance probability. Let w n = [3d* log n] . 



Take A = 1. Define J n (z) as in Lemma 4.4 



Lemma 4.14. There is a constant C such that for all zeZ, Zarge neN, and 
|t| < Vlogn, 

Ce" 7 " 



> 



n/ io„ ' 



Proof. Without restriction of generality let z = 0. Take some BeN. Let J n {B) 
be a two-dimensional discrete grid with mesh size B defined by 

J n {B) = Bi? n j„(o) = {(»,i) e sz 2 : i e [o,w n ],j ~ i e [C.Cll- 
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Then, by the Bonferroni inequality, 



(110) 



> u n (r) > S' n (B) - S[[(B), 



(j,i)eJi„(o) 

where S' n (B) and S'^{B) are defined by 

S 'n(B) = P ^ > U «( T )1' S 'n( B ) = E P t Z ^ > U n{T),Z V2 > U„(r)], 

and the second sum is taken over all v± ^= V2 with vi — G Sf n {B) and 



«2 = (*2, J2) € J n {B). We estimate S"(B) first. By Lemma 4.8 and Lemma 3.15 



F[2 V1 > u n (r),Z V2 > u n (r)} < e -°K~«»l 

nvlog" 



It follows that 
(111) 



n-y/log n 



e ~ll < 3*JL\ Jn (B)\ 

veBZ,'- 



n^/log n 



Now we estimate S' n (B). By Theorem 6 of [25] (which is a converse inequality to 
Theorem [23]), 



Hence, 
(112) 



P[^j > «n(r)] > C2U- 1 exp { - U(^) } 



> 



lyTog 



S'n(B) > \Jn(B)\ 



We can choose B so large that Ci £„ eBZ 2 C H II < c* 3 /2. Taking qTTop , (m) , 
(112) together and noting that \ J n (B)\ > cw n ylogn yields the statement of the 



lemma. 



□ 



Lemma 4.15. For every e > we can find r — r(e) (sufficiently close to —00) 
such that P[M„ < u n (r)] < e for all large n. 

Proof. Consider the sets J„(z), where z G 2u>„Z. There are at least n/(3w n ) such 
sets contained in E n . The exceedance events over these sets are independent, hence, 

Ce 



< 11- 



P[M„ < u„(t)] < 1 - P max Z itj > u n (r) 

The right hand-side converges to exp{— |Ce~ T }, as n — > 00. It follows that we can 
choose t so close to —00 that for all large n, the right-hand side is smaller than e. 
The proof is complete. □ 

4.8. Pickands-type constant. In this section we provide two alternative ex- 
pressions for the Pickands-type constant H*; see (16). Let lj.,12, ••■ be non- 
degenerate i.i.d. random variables such that ~Ee Yk = 1, k G N. Independently, 
let also y_i, y_2, ... be i.i.d. random variables such that 

(113) P[y_ fc G dy] = e v P[Y k £ dy], k G N. 

Note that Ee~ y - fc = 1, k G N. Define a stochastic process {W k ,k G Z} by Wq = 
and 

(114) w fc = y - 



+ y fe , = y_x + . . . + y_ fe , ken. 
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Lemma 4.16. Let Ljy = maxk=o,...,N Wk, k € N. Then, 
1 



(115) 

Remark 4.17. By 
R-R+, where 



lim — Ee L ' 

AT-foo N 



Vfc G N : W k < 0, VK_ fc < 



the probability on the right-hand side of (115) is equal to 

oo _^ 



> 



«]}. 



(116) fl + = e X p{-^ip[W fc >0]}, R. = cxp { - }_J ^nW-k 
fc=i fe=i 

Remark 4.18. By taking = — <p(t*) we obtain alternative expressions for 
the constant H t defined in (16). It follows from (115) that < -H* < 1. Since 
EYfe < 0, k £ Z, we have limfc-^-i-oo Wk = — oo a.s. by the law of large numbers. 
Therefore, we even have strict inequalities < if* < 1. 



Proof of Lemma 4- 16 Let g(w) = X)at=o ' 
ofEEl, 



i 



exp 



{ 



OO u 

UJ 



Et e[(1 

fc=i 



f Ee LN , \w\ < 1. Then, by p. 207 
)%(=>o]} 



1 00 k 

Y^W ex P { ~ E x (p[ ^ fc > 0] + v[w - k - 0]) }' 



It follows that g(w) ~ R + R-/(l 



k=l 



W 



as m t 1- The sequence Ee ijv is non- 



decreasing. By the Hardy-Littlewood Tauberian theorem, see Corollary 1.7.3 on 
p. 40 in [5], it follows that limTv^oo -^Ee ilv = f?+f?_. In view of Remark 
completes the proof. 



4.17 



this 
□ 



5. Proof in the sublogarithmic case 



Let U n = max{Xi, . . . ,X n }. Take any | < j3 < 



5.1. Proof of Theorem 

min(^, 1). By the assumption of the theorem, we have 

nP[Xi > log^ n] > ne~ loga " " -> oo, n- 
since a/3 < 1. It follows that 

lim P[U n < log' 9 nl = 0. 

n— voo 

The proof will be complete after we have shown that 
(117) lim P [M„ (a log n,n) < log' 3 n] = 1. 



Recall the definition of 7 in (29). Since 7 is a convex function we can find sq > 
such that J(s) > 3/a for all s > s . For every interval G I n with length 

l=2—i such that a log n < I < Sq 2 log 2 ^ n we have, by Lemma 



2.4 



P[^, > lo/ „] < exp { - If(^) } < exp { - I } < ^ 

Since 7(s) ~ |- as s | 0, we can find c > such that J(s) > cs 2 for all s G [0, so]. 
It follows that for every interval G I„ of length I > s^ 2 log 2 ^ n, 

F[Z i>:j > log' 3 n] < exp { - ui^^j } < exp{-clog 2 ' 3 n} < i 
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where we have used that /3 > \. Since the number of intervals in I„ is at most n 2 
it follows that (117) holds. 



1.7 



5.2. Proof of Theorem 

that a < 2). By assumption (19) we have 

'logn\ 1 / a 



Choose v € (0, 1) such that 2 1_ 



> 1 (recall 



nP 



Xl>V \DJ \ = ne M-{v a + o(l))\ogn} 



It follows that the maximum U„ 



lim 



maxj^!, . . . , X n } satisfies 
flogn\ l / a ~ 



U n <v( 



V D 



= 0. 



In view of Theorem |1.6| the proof of Theorem |1.7| will be complete after we have 
shown that 

, \ogn\ 1 / c 



(118) 



lim 



M„(2,logn)> V (^| n ) 



= 0. 



Assume first that a = 1. Then, condition (19) implies that tp(t) = logEe 1 is 
finite for t G [0,D), and equal to +oo for t > D. This implies that 

I(s) ~ Ds as s — > oo. 



Consider now the case a £ (1,2). By Kasahara's theorem [51 p. 253], condition (19) 
is equivalent to ip(t) - Gfr 3 as t -> oo where £ + i = 1 and G 1 " a /3" a (l — j3) = D. 
For the Legendre-Fenchel conjugate, one obtains [5J p. 48] 

I(s) ~ Z?s Q as s — > oo. 

Hence, both for a = 1 and for a E (1,2) we have I(s) > vDs a for large s. By 
Lemma 2.4 for every interval € I n of length 2 < I < logn, we have, for large 

71, 



lognx V« 



< 



exp 



{ 



_ u(JL( \ogn 



1/a 



| < exp | 



Recall that 2 1 " 



2 V 



1+Q > 1. Since the number of intervals in 



-2 1 -%v 1+a logn}. 
I„ with length not 



exceeding logn is at most nlogn, we obtain (118). 
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